"I do not know! but why?"- Local model-agnostic example-based explanations of reject

Artelt, André; Visser, Roel; Hammer, Barbara

"I do not know! but why?"- Local model-agnostic example-based explanations of reject

Artelt A, Visser R, Hammer B (2023)
Neurocomputing 558: 126722.

Zeitschriftenaufsatz | Veröffentlicht | Englisch

Download

Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!

DOI

https://doi.org/10.1016/j.neucom.2023.126722

Autor*in

Artelt, André^UniBi ; Visser, Roel^UniBi; Hammer, Barbara^UniBi

Einrichtung

SFB/Transregio 318 Constructing Explainability > Projektbereich C: Darstellung und Berechnung von Erklärungen > Teilprojekt C03: Interpretierbares maschinelles Lernen: Erklärbarkeit in dynamischen Umgebungen
SFB/Transregio 318 Constructing Explainability > Projektbereich C: Darstellung und Berechnung von Erklärungen > Teilprojekt C01: Gesundes Misstrauen in und durch Erklärungen
Technische Fakultät > AG Machine Learning
Center of Excellence - Cognitive Interaction Technology CITEC > Machine Learning

Abstract / Bemerkung

Machine learning based decision making systems in safety critical areas place high demands on the accuracy and generalization ability of the underlying model. A common strategy to deal with uncertainties and possible mistakes is offered by learning with reject option, i.e. a model can refrain from prediction in ambiguous cases and leave the decision to a human expert. Yet, as for the models themselves, human decision-making is hampered by the fact that reject options are often implemented as black-box rules: Experts cannot readily understand the reasons for rejection.In this work, we propose a model-agnostic framework that enriches classification with reject option by explanation mechanisms. More specifically, we combine conformal prediction as a popular mathematically based technology of certainty estimation with local surrogates derived for the region of interest. This allows us to provide local explanations in terms of example-based explanation methods, including counterfactual, semi-factual, and factual methods. We demonstrate the performance of this technology through a series of benchmarks using 6 different data sets; the associated code is open source.1

Stichworte

Reject option; XAI; Example-based explanations; Model-agnostic; explanations; Local explanations; Counterfactual explanations; Conformal; prediction

Erscheinungsjahr

2023

Zeitschriftentitel

Neurocomputing

Band

558

Art.-Nr.

126722

ISSN

0925-2312

eISSN

1872-8286

Page URI

https://pub.uni-bielefeld.de/record/2983728

Zitieren

Artelt A, Visser R, Hammer B. "I do not know! but why?"- Local model-agnostic example-based explanations of reject. Neurocomputing. 2023;558: 126722.

Artelt, A., Visser, R., & Hammer, B. (2023). "I do not know! but why?"- Local model-agnostic example-based explanations of reject. Neurocomputing, 558, 126722. https://doi.org/10.1016/j.neucom.2023.126722

Artelt, André, Visser, Roel, and Hammer, Barbara. 2023. “"I do not know! but why?"- Local model-agnostic example-based explanations of reject”. Neurocomputing 558: 126722.

Artelt, A., Visser, R., and Hammer, B. (2023). "I do not know! but why?"- Local model-agnostic example-based explanations of reject. Neurocomputing 558:126722.

Artelt, A., Visser, R., & Hammer, B., 2023. "I do not know! but why?"- Local model-agnostic example-based explanations of reject. Neurocomputing, 558: 126722.

A. Artelt, R. Visser, and B. Hammer, “"I do not know! but why?"- Local model-agnostic example-based explanations of reject”, Neurocomputing, vol. 558, 2023, : 126722.

Artelt, A., Visser, R., Hammer, B.: "I do not know! but why?"- Local model-agnostic example-based explanations of reject. Neurocomputing. 558, : 126722 (2023).

Artelt, André, Visser, Roel, and Hammer, Barbara. “"I do not know! but why?"- Local model-agnostic example-based explanations of reject”. Neurocomputing 558 (2023): 126722.

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Web of Science

Dieser Datensatz im Web of Science®

Suchen in

Google Scholar

PUB - Publikationen an der Universität Bielefeld