Rule Extraction in Trained Feedforward Deep Neural Networks: Integrating Cosine Similarity and Logic for Explainability

Artículo

Acceso Abierto

Rule Extraction in Trained Feedforward Deep Neural Networks: Integrating Cosine Similarity and Logic for Explainability

Negro, Pablo Ariel

|

Pons, Claudia Fabiana

Fecha de publicación

diciembre de 2024

Lugar de desarrollo

Laboratorio de Investigación y Formación en Informática Avanzada (LIFIA)

Centro CIC

Laboratorio de Investigación y Formación en Informática Avanzada (LIFIA)

Serie

International Journal of Artificial Intelligence and Machine Learning

Volumen de la revista

vol. 13, no. 1

Idioma

Inglés

Materia

Ciencias de la Computación e Información

HDL 11746/12471

ISSN 2642-1585

DOI 10.4018/IJAIML.347988

Descargas

Documento completo (1019.65 KB)

Enlace externo

https://doi.org/10.4018/IJAIML.347988

Resumen

Explainability is a key aspect of machine learning, necessary for ensuring transparency and trust in decision-making processes. As machine learning models become more complex, the integration of neural and symbolic approaches has emerged as a promising solution to the explainability problem. One effective solution involves using search techniques to extract rules from trained deep neural networks by examining weight and bias values and calculating their correlation with outputs. This article proposes incorporating cosine similarity in this process to narrow down the search space and identify the critical path connecting inputs to final results. Additionally, the integration of first-order logic (FOL) is suggested to provide a more comprehensive and interpretable understanding of the decision-making process. By leveraging cosine similarity and FOL, an innovative algorithm capable of extracting and explaining rule patterns learned by a feedforward trained neural network was developed and tested in two use cases, demonstrating its effectiveness in providing insights into model behavior.

Palabras clave

Artificial Intelligence

Black Box Models

Cosine Similarity

Deep Learning

Distance Function

Entropy

Explainability

Feedforward Neural Network

Logic

Regularization

Rule Extraction

Esta obra se publica con la licencia Creative Commons Attribution 4.0 International (BY 4.0)

Página completa del ítem

Rule Extraction in Trained Feedforward Deep Neural Networks: Integrating Cosine Similarity and Logic for Explainability

Título alternativo

Título de investigación

Directores

Compiladores

Editores

Editorial

Fecha de publicación

Descripción

Emisor del título

Lugar de desarrollo

Centro CIC

Libro/Informe

Recursos relacionados

Serie

Volumen de la revista

Idioma

Materia

Area temática

Clasificación FORD

Cobertura Espacial

Extensión

Descargas

Enlace externo

Resumen

Palabras clave

item.page.license