Giter Site home page Giter Site logo

nfl-player-stats's Issues

Resolve stats across players on a team

What I mean here is that we should resolve certain stats from one player to other players in the game. For example, if the QB is sacked, we should add a field for lineman that tracks the number of sacks they allowed.

This will take some brainstorming to think of all the stats we want to resolve to different players after the scrape has finished.

Team IDs

Regarding future improvement 2 :

Extract Team IDs to make relating players across teams and games easier

Did you have in mind a Team ID other than their three letter code (e.g. DAL)? Perhaps a unique team ID for each year of a franchise, or an ID for each unique group of players? This could be used to distinguish high-performing vs. low-performing teams that are part of the same franchise but played in different seasons.

Migrate saving data to SQLite database

In the long term (when we have daily/weekly scrape jobs) we will want a more robust setup, but for now it would be nice to save things into an SQLite database so that we can easily query the data and start to get an idea for how we want to structure the database records going forward.

Scrape College Data

Either through the link on pro-football-reference.com or through another site.

Problems for scraping players stats

The script raised this exception when I tried to scrape the players stats. I made the test in Jupyter:

ConnectionError: HTTPSConnectionPool(host='www.pro-football-reference.comhttps', port=443): Max retries exceeded with url: //stathead.com/football/pgl_finder.cgi?player_id=AaitIs00 (Caused by NewConnectionError('<urllib3.connection.VerifiedHTTPSConnection object at 0x7fc45c241520>: Failed to establish a new connection: [Errno -2] Name or service not known'))

Universal Player, Game, and Team ID's

I think we need to come up with a way to resolve IDs across sites for when we expand our scraping. So instead of blindly assigning players integer IDs, we could take the hash of their name, birthday, and hometown or something. It seems like that would be a fairly unique combo. The only catch here is that we only want to use info that is likely to be on every site we scrape. Some sites might not have the player hometown, so we would not be able to resolve the ID.

Let's brainstorm the best ways to resolve IDs for players, games, and teams so that we can keep single database records for each while pulling from many data sources.

More Player Data

We should research where we can find more granular player data, such as fumbles, passes defended, etc. We might have to explore scraping other sites.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.