A Functionally-Grounded Benchmark Framework for XAI Methods: Insights and Foundations from a Systematic Literature Review

Dulce Canha, Sylvain Kubler, Kary Främling, Guy Fagherazzi

Research output: Contribution to journalArticleResearchpeer-review

1 Citation (Scopus)

Abstract

Artificial Intelligence (AI) is transforming industries, offering new opportunities to manage and enhance innovation. However, these advancements bring significant challenges for scientists and businesses, with one of the most critical being the ‘trustworthiness” of AI systems. A key requirement of trustworthiness is transparency, closely linked to explicability. Consequently, the exponential growth of eXplainable AI (XAI) has led to the development of numerous methods and metrics for explainability. Nevertheless, this has resulted in a lack of standardized and formal definitions for fundamental XAI properties (e.g., what do soundness, completeness, and faithfulness of an explanation entail? How is the stability of an XAI method defined?). This lack of consensus makes it difficult for XAI practitioners to establish a shared foundation, thereby impeding the effective benchmarking of XAI methods. This survey article addresses these challenges with two primary objectives. First, it systematically reviews and categorizes XAI properties, distinguishing them between human-centered (relying on empirical studies involving explainees) or functionally-grounded (quantitative metrics independent of explainees). Second, it expands this analysis by introducing a hierarchically structured, functionally grounded benchmark framework for XAI methods, providing formal definitions of XAI properties. The framework’s practicality is demonstrated by applying it to two widely used methods: LIME and SHAP.

Original languageEnglish
Article numberART320
Number of pages40
JournalACM Computing Surveys
Volume57
Issue number12
DOIs
Publication statusPublished - 14 Jul 2025

Keywords

  • Artificial intelligence
  • eXplainable AI (XAI)
  • interpretability
  • machine learning
  • responsible AI
  • transparency
  • trustworthiness

Fingerprint

Dive into the research topics of 'A Functionally-Grounded Benchmark Framework for XAI Methods: Insights and Foundations from a Systematic Literature Review'. Together they form a unique fingerprint.

Cite this