Giter Site home page Giter Site logo

katakwar86 / drwhy Goto Github PK

View Code? Open in Web Editor NEW

This project forked from modeloriented/drwhy

0.0 1.0 0.0 8.49 MB

DrWhy is the collection of tools for Explainable AI (XAI). It's based on shared principles and simple grammar for exploration, explanation and visualisation of predictive models.

Home Page: https://ModelOriented.github.io/DrWhy/

R 100.00%

drwhy's Introduction

DrWhy.AI - the collection of tools for Visual Exploration, Explanation and Debugging of Predictive Models

It takes a village to raise a child model.

The way how we do predictive modeling is very ineffective. We spend way too much time on manual time consuming and easy to automate activities like data cleaning and exploration, crisp modeling, model validation. Instead of focusing on model understanding, productisation and communication.

Here we gather tools that can be use to make out work more efficient through the whole model lifecycle. The unified grammar beyond DrWhy.AI universe is described in the Predictive Models: Visual Exploration, Explanation and Debugging book.

Lifecycle for Predictive Models

The DrWhy is based on an unified Model Development Process based on RUP. Find an overview in the diagram below.

images/DALEXverse.png

Tools that are usefull during the model lifetime. MI2 stands for our internal tools.

1. Data preparation

  • dataMaid; A Suite of Checks for Identification of Potential Errors in a Data Frame as Part of the Data Screening Process
  • inspectdf; A collection of utilities for columnwise summary, comparison and visualisation of data frames.
  • validate; Professional data validation for the R environment
  • errorlocate; Find and replace erroneous fields in data using validation rules
  • ggplot2; System for declaratively creating graphics, based on The Grammar of Graphics.

2. Data understanding

  • Model Agnostic Variable Importance Scores. Surrogate learning = Train an elastic model and measure feature importance in such model. See DALEX, Model Class Reliance MCR
  • vip Variable importance plots
  • SAFE MI2 Surrogate learning = Train an elastic model and extract feature transformations.
  • xspliner MI2 Using surrogate black-boxes to train interpretable spline based additive models
  • factorMerger MI2 Set of tools for factors merging paper
  • ingredients MI2 Set of tools for model level feature effects and feature importance.

4. Model assembly

5. Model audit

  • auditor MI2 model verification, validation, and error analysis vigniette
  • DALEX MI2 Descriptive mAchine Learning EXplanations
  • iml; interpretable machine learning R package
  • randomForestExplainer MI2 A set of tools to understand what is happening inside a Random Forest
  • survxai MI2 Explanations for survival models paper

6. Model delivery

DrWhy.AI indicator panel

Active development and maintenance

These packages are actively developed and have active maintainer.

Experimental pre-seed phase (under active development)

Experimental or without maintenance (looking for maintainer!!!)

These packages contain useful features, are still in use but we are looking for an active maintainer.

  • randomForestExplainer CRAN_Status_Badge Build Status Coverage Status Downloads
  • factorMerger CRAN_Status_Badge Build Status Coverage Status Downloads
  • cr17 CRAN_Status_Badge Build Status Coverage Status Downloads
  • MLGenSig CRAN_Status_Badge Build Status Coverage Status Downloads
  • pyBreakDown Build Status Coverage Status

In the sunset phase, without maintenance

Key features from these packages are copied to another packages.

  • ceterisParibus CRAN_Status_Badge Build Status Coverage Status Downloads (development moved to ingredients)
  • ceterisParibus2 CRAN_Status_Badge Build Status Coverage Status (development moved to ingredients)
  • DALEX2 CRAN_Status_Badge Build Status Coverage StatusDownloads (development moved to DALEX)
  • breakDown CRAN_Status_Badge Build Status Coverage Status Downloads (development moved to iBreakDown)
  • live CRAN_Status_Badge Total Downloads Build Status Coverage StatusDownloads (development moved to localModel)

Family of Model Explainers

images/DrWhyAI.png

Family of Model Explainers

Architecture of DrWhy

DrWhy works on fully trained predictive models. Models can be created with any tool.

DrWhy uses DALEX2 package to wrap model with additional metadata required for explanations, like validation data, predict function etc.

Explainers for predictive models can be created with model agnostic or model specific functions implemented in various packages.

Architecture of DrWhy

Hype_Cycle

drwhy's People

Contributors

pbiecek avatar mstaniak avatar kant avatar maksymiuks avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.