Giter Site home page Giter Site logo

michaeljwelsh / med-school-dataset-curation-and-analysis Goto Github PK

View Code? Open in Web Editor NEW
0.0 3.0 0.0 2.96 MB

Medical school admissions dataset curation via web scraping and exploratory data analysis

License: MIT License

Jupyter Notebook 100.00%
admissions dataset-creation exploratory-data-analysis medical-school presentation webscraping

med-school-dataset-curation-and-analysis's Introduction

Medical School Admissions Dataset Curation via Web Scraping and Exploratory Data Analysis

The average acceptance rate out of the ~170 medical schools in the U.S. is around 5.5%. Airfare for interviewing alone can exceed $500, on top of the other hundreds of dollars to apply and send primary/secondary applications to just a single school. Despite these expenses, it's necessary to apply to 20-30 schools to get an acceptance, and for many, you cannot afford, both literally and figuratively, to not get accepted and reapply the following year. How do you pick your list of schools to maximize your chances of acceptance?

In this final project, medical school admission statistics are scraped from the internet and turned into a dataset. This dataset includes numerous things such as MCAT/GPA quantiles, in/out-of-state acceptance rates/bias, demographics, geographics, funding and institution type, residency match rates by specialty, etc. Exploratory data analysis is then performed to determine the list of schools my fiancée, based on her background, should apply to, to maximize her chances of acceptance this cycle.

Drexel class: Data Science 521 Data Analysis and Interpretation

Data Usage

Note that due to data usage policies by AAMC, I am not allowed to share or distribute the data I collected nor the code used to parse/transform the data into a dataset. I can share my exploratory data analysis and presentation which I hope you'll enjoy.

med-school-dataset-curation-and-analysis's People

Contributors

michaeljwelsh avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.