Giter Site home page Giter Site logo

valda's Introduction

Valda

Downloads

Introduction

Valda is a Python package for data valuation in machine learning. If you are interested in

  • analyzing the contribution of individual training examples to the final classification performance, or
  • identifying some noisy examples in the training set,

you may be interested in the functions provided by this package.

The current version supports five different data valuation methods. It supports all the classifiers from Sklearn for valuation, and also user-defined classifier using PyTorch.

  • Leave-one-out (LOO),
  • Data Shapley with the TMC algorithm (TMC-Shapley) from Ghorbani and Zou (2019),
  • Beta Shapley from Kwon and Zou (2022)
  • Class-wise Shapley (CS-Shapley) from Schoch et al. (2022)
  • Influence Function (IF) from Koh and Liang (2017)
    • IF only works with the classifiers built with PyTorch, because it requires gradient computation.
    • The current version only support the first-order gradient computation, and we will add the second-order computation soon.

Tutorial

Please checkout a simple tutorial on Google Colab, for how to use this package.

valda's People

Contributors

uvanlp avatar jiyfeng avatar uvaml avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.