arXiv Open Access 2023

Categorical Foundations of Explainable AI: A Unifying Theory

Pietro Barbiero Stefano Fioravanti Francesco Giannini Alberto Tonda Pietro Lio +1 lainnya
Lihat Sumber

Abstrak

Explainable AI (XAI) aims to address the human need for safe and reliable AI systems. However, numerous surveys emphasize the absence of a sound mathematical formalization of key XAI notions -- remarkably including the term "explanation" which still lacks a precise definition. To bridge this gap, this paper presents the first mathematically rigorous definitions of key XAI notions and processes, using the well-funded formalism of Category theory. We show that our categorical framework allows to: (i) model existing learning schemes and architectures, (ii) formally define the term "explanation", (iii) establish a theoretical basis for XAI taxonomies, and (iv) analyze commonly overlooked aspects of explaining methods. As a consequence, our categorical framework promotes the ethical and secure deployment of AI technologies as it represents a significant step towards a sound theoretical foundation of explainable AI.

Topik & Kata Kunci

Penulis (6)

P

Pietro Barbiero

S

Stefano Fioravanti

F

Francesco Giannini

A

Alberto Tonda

P

Pietro Lio

E

Elena Di Lavore

Format Sitasi

Barbiero, P., Fioravanti, S., Giannini, F., Tonda, A., Lio, P., Lavore, E.D. (2023). Categorical Foundations of Explainable AI: A Unifying Theory. https://arxiv.org/abs/2304.14094

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓