On the limits of neural network explainability via descrambling

Sule, Shashank; Spencer, Richard G.; Czaja, Wojciech

Computer Science > Machine Learning

arXiv:2301.07820 (cs)

[Submitted on 18 Jan 2023 (v1), last revised 2 Sep 2024 (this version, v3)]

Title:On the limits of neural network explainability via descrambling

Authors:Shashank Sule, Richard G. Spencer, Wojciech Czaja

View PDF HTML (experimental)

Abstract:We characterize the exact solutions to neural network descrambling--a mathematical model for explaining the fully connected layers of trained neural networks (NNs). By reformulating the problem to the minimization of the Brockett function arising in graph matching and complexity theory we show that the principal components of the hidden layer preactivations can be characterized as the optimal explainers or descramblers for the layer weights, leading to descrambled weight matrices. We show that in typical deep learning contexts these descramblers take diverse and interesting forms including (1) matching largest principal components with the lowest frequency modes of the Fourier basis for isotropic hidden data, (2) discovering the semantic development in two-layer linear NNs for signal recovery problems, and (3) explaining CNNs by optimally permuting the neurons. Our numerical experiments indicate that the eigendecompositions of the hidden layer data--now understood as the descramblers--can also reveal the layer's underlying transformation. These results illustrate that the SVD is more directly related to the explainability of NNs than previously thought and offers a promising avenue for discovering interpretable motifs for the hidden action of NNs, especially in contexts of operator learning or physics-informed NNs, where the input/output data has limited human readability.

Subjects:	Machine Learning (cs.LG); Signal Processing (eess.SP); Numerical Analysis (math.NA)
Cite as:	arXiv:2301.07820 [cs.LG]
	(or arXiv:2301.07820v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2301.07820

Submission history

From: Shashank Sule [view email]
[v1] Wed, 18 Jan 2023 23:16:53 UTC (1,904 KB)
[v2] Wed, 9 Aug 2023 00:44:38 UTC (3,429 KB)
[v3] Mon, 2 Sep 2024 21:17:39 UTC (3,706 KB)

Computer Science > Machine Learning

Title:On the limits of neural network explainability via descrambling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On the limits of neural network explainability via descrambling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators