Publications

August, 2024

Sparse Explanations of Neural Networks Using Pruned Layer-Wise Relevance Propagation

Authors:
Paulo Yanez Sarmiento
Simon Witzke
Nadja Klein
Bernhard Y. Renard

Published in:
Machine Learning and Knowledge Discovery in Databases. Research Track. ECML PKDD 2024. Lecture Notes in Computer Science

Abstract:
Explainability is a key component in many applications involving deep neural networks (DNNs). However, current explanation methods for DNNs commonly leave it to the human observer to distinguish relevant explanations from spurious noise. This is not feasible anymore when going from easily human-accessible data such as images to more complex data such as genome sequences. To facilitate the accessibility of DNN outputs from such complex data and to increase explainability, we present a modification of the widely used explanation method layer-wise relevance propagation. Our approach enforces sparsity directly by pruning the relevance propagation for the different layers. Thereby, we achieve sparser relevance attributions for the input features as well as for the intermediate layers. As the relevance propagation is input-specific, we aim to prune the relevance propagation rather than the underlying model architecture. This allows to prune different neurons for different inputs and hence, might be more appropriate to the local nature of explanation methods. To demonstrate the efficacy of our method, we evaluate it on two types of data: images and genome sequences. We show that our modification indeed leads to noise reduction and concentrates relevance on the most important features compared to the baseline.

Publications

Sparse Explanations of Neural Networks Using Pruned Layer-Wise Relevance Propagation

TransferGWAS of T1-weighted Brain MRI Data from UK Biobank

Metadata-guided Feature Disentanglement for Functional Genomics

Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantification

DeepRepViz: Identifying Potential Confounders in Deep Learning Model Predictions

Sparse Explanations of Neural Networks Using Pruned Layer-Wise Relevance Propagation

Explainable AI for Audio via Virtual Inspection Layers

Reveal to Revise: An Explainable AI Life Cycle for Iterative Bias Correction of Deep Models

Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression

Explainable Artificial Intelligence (XAI) 2.0: A Manifesto of Open Challenges and Interdisciplinary Research Directions

Explaining Predictive Uncertainty by Exposing Second-Order Effects

Model guidance via explanations turns image classifiers into segmentation models

PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits

From Hope to Safety: Unlearning Biases of Deep Models via Gradient Penalization in Latent Space

Understanding the (Extra-)Ordinary: Validating Deep Model Decisions with Prototypical Concept-based Explanations

Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence

DualView: Data Attribution from the Dual Perspective

Regression in quotient metric spaces with a focus on elastic curves