Generating Human Interpretable Rules from Convolutional Neural Networks

Russel Pears
Ashwini Kumar Sharma

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Advancements in the field of Artificial Intelligence has been rapid in recent years and has revolutionized various industries. Various deep neural network architectures capable of handling both text and images, covering code generation from natural language, producing machine translation and text summarizations have been proposed. For example, Convolutional Neural Networks or CNNs perform image classification at a level equivalent to that of humans on many image datasets. These state-of-the-art networks have reached unprecedented levels of success by using complex architectures with billions of parameters, numerous kernel configurations, weight initialization and regularization methods. Unfortunately to reach this level of success, the models that CNNs use are essentially black box in nature with little or no human interpretable information on the decision-making process. This lack of transparency in decision making gave rise to concerns amongst some sectors of the user community such as healthcare, finance, justice, defense, among others. This challenge motivated our research where we successfully produced human interpretable influential features from CNNs for image classification and captured the interactions between these features by producing a concise decision tree making accurate classification decisions. The proposed methodology made use of a pre-trained VGG16 with finetuning to extract feature maps produced by learnt filters. On the CelebA image benchmark dataset, we successfully produced human interpretable rules that captured the main facial landmarks responsible for segmenting males from females with 89.6% accuracy, while on the more challenging Cats vs Dogs dataset the decision tree produced 87.6% accuracy.

Version published to 10.20944/preprints202501.1573.v1
Jan 21, 2025

HyperLLM: The Next Generation of Large Language Models with Multimodal Capabilities

This article has 1 author:
1. mohsen ghorbian
This article has no evaluationsLatest version Feb 10, 2025
Large Language Models Enable Textual Interpretation of Image-Based Astronomical Transient Classifications

This article has 7 authors:
1. Fiorenzo Stoppa
2. Turan Bulmus
3. Steven Bloemen
4. Stephen Smartt
5. Paul Groot
6. Paul Vreeswijk
7. Ken Smith
This article has no evaluationsLatest version Jan 20, 2025
Leveraging a Vision-Language Model with Natural Text Supervision for MRI Retrieval, Captioning, Classification, and Visual Question Answering

This article has 3 authors:
1. Nikhil J. Dhinagar
2. Sophia I. Thomopoulos
3. Paul M. Thompson
This article has no evaluationsLatest version Feb 20, 2025

Listed in

Abstract

Article activity feed

Related articles

HyperLLM: The Next Generation of Large Language Models with Multimodal Capabilities

Large Language Models Enable Textual Interpretation of Image-Based Astronomical Transient Classifications

Leveraging a Vision-Language Model with Natural Text Supervision for MRI Retrieval, Captioning, Classification, and Visual Question Answering