Giter Site home page Giter Site logo

gganssle / data-day-tx-2018 Goto Github PK

View Code? Open in Web Editor NEW
9.0 2.0 4.0 24.15 MB

Evaluation of gender bias in a graph-based classification algorithm.

License: Other

Python 0.88% Jupyter Notebook 99.12%
machine-learning-bias graph-convolutional-networks node-classification gender-equality gender-gap gender-inequality

data-day-tx-2018's Introduction

data-day-tx-2018

This is the talk given by Graham Ganssle and Steve Purves at Data Day Texas 2018. This talk was given in conjunction with Lynn Pausic and Chris LaCava's talk about how human bias is preserved in machine learning systems.

The Aim

We show how biased training data biases results of model outputs by assessing the qualification of loan applicants based on US Census data. We train our model on a dense, varied dataset and quantify the difference in apparent loan-worthiness with respect to applicant gender.

Are female loan applicants automatically screened out of credit applications by biased computer models?

Methods

We use a graph convolutional network to predict a node property (credit worthiness) from other node properties and edge connections to other credit applicants.

Data

The data used in this experiment is extracted from the 1994 US Census data. It is the commonly referenced Census-Income dataset, AKA the "Adult" dataset. We got it from the UCI ML Repo, here.

How do you run this thing?

You first have to condition the data by running the data_cleaning and test_cleaning notebooks. Then you have to run the graphicator notebook to build the graph and associated files out of the clean csv files.

Before you run train the GCN you have to build the GCN code. Do this by cd gcn; python setup.py install;. Then to train, cd into the one-level-deeper gcn/ and run the training script: cd gcn; python train --dataset credit.

A Tip of Our Hat

Creative Commons License
data-day-TX-2018 by Lynn Pausic, Graham Ganssle, Steve Purves, Expero Inc is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License, except where otherwise noted.

This work borrows heavily from Graph Convolutional Networks by Thomas Kipf and Max Welling, licensed MIT: ©Thomas Kipf, 2016. You can find their excellent paper here.

The data used in this experiment was obtained from the UCI ML Repository: Lichman, M. (2013). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.

data-day-tx-2018's People

Contributors

gganssle avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.