Rule Extraction in Trained Feedforward Deep Neural Networks - Integrating Cosine Similarity and Logic for Explainability

Negro, Pablo Ariel; Pons, Claudia Fabiana

Rule Extraction in Trained Feedforward Deep Neural Networks - Integrating Cosine Similarity and Logic for Explainability

Files

0000755743.pdf (446.12 KB)

Date

2024-12-30

Authors

Negro, Pablo Ariel

Pons, Claudia Fabiana

Publisher

Universidad Abierta Interamericana. Facultad de Tecnología Informática

Abstract

Explainability is a fundamental aspect in the field of machine learning, particularly in ensuring transparency and trust in decision-making processes. As the complexity of machine learning models increases, the integration of neural and symbolic approaches has emerged as a promising solution to the explainability problem. In this context, the utilization of search methods for rule extraction in trained deep neural networks has been proven effective. This involves the examination of weight and bias values generated by the network, typically through calculating the correlation between weight vectors and outputs. The hypothesis developed in this article states that by incorporating cosine similarity in this process, the search space can be efficiently narrowed down to identify the critical path connecting inputs to results. Furthermore, to provide a more comprehensive and interpretable understanding of the decision making process, this article proposes the integration of first-order logic (FOL) in the rule extraction process. By leveraging cosine similarity and FOL, a groundbreaking algorithm that is capable of extracting and explaining the rule patterns learned by a feedforward trained neural network was designed and implemented. The algorithm was tested in three use cases showing effectiveness in providing insights into the model’s behavior.

Keywords

artificial intelligence, cosine similarity, deep learning, explainability, logic, rule extraction

Citation

Negro, P. A., & Pons, C. (2024). Rule Extraction in Trained Feedforward Deep Neural Networks: Integrating Cosine Similarity and Logic for Explainability In: International Journal Of Artificial Intelligence And Machine Learning, 13(1), 1-22.

URI

https://repositorio.uai.edu.ar/handle/123456789/4745

Collections

Técnicas de Inteligencia Artificial basadas en una integración de la lógica simbólica y no-simbólica

Full item page