Giter Site home page Giter Site logo

pdsnd_github's Introduction

Python Script to Explore US Bikeshare Data This Python script is written for Project 2 (Term 1) of Udacity's Data Analyst Nanodegree (DAND) and is used to explore data related to bike share systems for Chicago, New York City, and Washington. It imports data from csv files and compute descriptive statistics from the data. It also takes in users' raw input to create an interactive experience in the terminal to present these statistics.

How to run the script You can run the script using a Python integrated development environment (IDE) such as Spyder. To install Spyder, you will need to download the Anaconda installer. This script is written in Python 3, so you will need the Python 3.x version of the installer. After downloading and installing Anaconda, you will find the Spyder IDE by opening Anaconda Navigator.

Datasets The datasets used for this script contain bike share data for the first six months of 2017. Some data wrangling has been performed by Udacity's staff before being provided to the students of DAND. Under the permission of Udacity, I have uploaded a copy of the datasets here. The file sizes are too big to be uploaded on GitHub, so they were uploaded on Google Drive instead. After downloading the datasets, place them in the same folder with this Python script.

The data is provided by Motivate, which is a bike share system provider for many cities in the United States. The data files for all three cities contain the same six columns:

Start Time End Time Trip Duration (in seconds) Start Station End Station User Type (Subscriber or Customer) The Chicago and New York City files also contain the following two columns:

Gender Birth Year Questions explored The script answers the following questions about the bike share data:

What is the most popular month for start time? What is the most popular day of week (Monday, Tuesday, etc.) for start time? What is the most popular hour of day for start time? What is the total trip duration and average trip duration? What is the most popular start station and most popular end station? What is the most popular trip? What are the counts of each user type? What are the counts of gender? What are the earliest (i.e. oldest person), most recent (i.e. youngest person), and most popular birth years?

Future scopes In the future, more functions that compute statistics will be added to answer more questions about the data. The possibilities of improving the interactive experience (e.g turning this script into a web app) will also be explored.

Resources referred to complete this project Use parse_dates to recognize datetime columns:

https://stackoverflow.com/questions/21269399/datetime-dtypes-in-pandas-read-csv https://stackoverflow.com/questions/17465045/can-pandas-automatically-recognize-dates https://pandas.pydata.org/pandas-docs/stable/generated/pandas.read_csv.html

pdsnd_github's People

Contributors

tapariak avatar rbudacprojects avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.