Giter Site home page Giter Site logo

josesousaribeiro / xai-benchmark Goto Github PK

View Code? Open in Web Editor NEW
2.0 1.0 0.0 47.13 MB

This is a repository for reproducibility purposes. In this research, a large number of datasets were used to create different ML models, which were then explained by XAI measures. Seeking to identify situations where XAI measures agreed or disagreed with each other.

Jupyter Notebook 100.00%
ciu cluster dalex eli5 model shap skater xai lofo

xai-benchmark's Introduction

XAI Benchmark

This repository has research analysis data for Explainable Artificial Intelligence - XAI.

The research group responsible for this study is composed of:

José Ribeiro - site: https://sites.google.com/view/jose-sousa-ribeiro

Raíssa Silva - site: https://sites.google.com/site/silvarailors

Lucas Cardoso - site: http://lattes.cnpq.br/9591352011725008

Ronnie Alves (Leader) - site: https://sites.google.com/site/alvesrco/

Abstract

Strategies based on Explainable Artificial Intelligence - XAI have emerged in computing to promote a better understanding of predictions made by black box models. Most XAI measures used today explain these types of models, generating attribute rankings aimed at explaining the model, that is, the analysis of Attribute Importance of Model. There is no consensus on which XAI measure generates an overall explainability rank. For this reason, several proposals for tools have emerged (Ciu, Dalex, Eli5, Lofo, Shap and Skater). An experimental benchmark of explainable AI techniques capable of producing global explainability ranks based on tabular data related to different problems and ensemble models are presented herein. Seeking to answer questions such as "Are the explanations generated by the different measures the same, similar or different?" and How does data complexity play along model explainability?" The results from the construction of 82 computational models and 592 ranks shed some light on the other side of the problem of explainability: dataset complexity!

Proposal

This research raises discussions for the current moment in the XAI area through two main questions: --- Considering the current tools aimed at explaining black box machine learning models, it can be inferred that they generate global rankings of same, similar, or different explainabilities? --- Following the same idea as in the previous question, are the generations of equal, similar, or different explainabilities related to specific properties of a dataset?

Seeking to answer the two hypotheses presented above, this research emerges as a comparative analysis of different XAI metrics, capable of producing model-agnostic global explainability ranks based on tabular data related to different problems.

Benchmark

In the benchmark proposed by this research, the explanatory ranks of 6 different XAI metrics (Ciu, Dalex, Eli5, Lofo, Shap and Skater) are compared using Spearman coefficients, resulting in conclusive results.

The methodology is presented in the diagram below:

alt text

All the information needed for a better understanding and reproducibility of the procedures performed in the research can be accessed directly on the notebook (Google Colab) available at root, as well as in the files folders.

Cite this work

@inproceedings{ribeiro2021does,
  title={Does Dataset Complexity Matters for Model Explainers?},
  author={Ribeiro, Jos{\'e} and Silva, Ra{\'\i}ssa and Cardoso, Lucas and Alves, Ronnie},
  booktitle={2021 IEEE International Conference on Big Data (Big Data)},
  pages={5257--5265},
  year={2021},
  organization={IEEE}
}

xai-benchmark's People

Contributors

josesousaribeiro avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.