Giter Site home page Giter Site logo

jajokine / time-series-modeling Goto Github PK

View Code? Open in Web Editor NEW
4.0 1.0 7.0 811 KB

MITx - MicroMasters Program on Statistics and Data Science - Data Analysis: Statistical Modeling and Computation in Applications - Fourth Project

License: MIT License

Jupyter Notebook 100.00%
linear-regression autoregressive-model autoreg sarimax predictive-modeling forecasting-model time-series time-series-analysis financial-data environmental-modelling

time-series-modeling's Introduction

Time-Series-Modeling

MITx - MicroMasters Program on Statistics and Data Science - Data Analysis: Statistical Modeling and Computation in Applications - Fourth Project

The fourth project of the MIT MicroMasters Program course on Data Analysis focused on time series data where each observation corresponds to one point in time. The objectives included carrying first some descriptive analysis with the help of visualizing the different components and their dependencies before moving towards modeling and eventually forecasting with the model. The three datasets include an environmental dataset that is said to be the most important data set in modern climate research, and two financial ones, which come from the MIT Billion Prices project (www.thebillionpricesproject.com).

The first dataset consists recordings of the concentration of carbon dioxide - CO2 - by Charles David Keeling at Mauna Loa, a volcano on the Hawaii island that he, and later his son, recorded since 1958 till now. What makes this data unique is that the samples have been taken for over a 60 year time span with the same measuring technique with very few interruptions and inhomogeneities. It is the longest continuous record of atmospheric carbon dioxide in the world and is an essential piece of evidence of the man-made increases in greenhouse gases that are believed to be the cause of global warming. This dataset was analyzed with a linear regression model that included non-linear and seasonal components.

The financial dataset contain the Consumer Price Index (CPI) which tracks inflation through a market basket of consumer goods and services and which is released monthly by the Bureau of Labor Statistics, the PriceStats daily consumer price index, and the Break-Even Rate (BER) which is the difference in yield between a fixed rate and inflation adjusted 10 year treasury note, and which can be seen as the market's view for the inflation rate for the next 10 years, on average. Some calculations concerning different ways to calculate the inflation rate was first done before modeling the dataset with a SARIMAX model that included autoregressive and moving average components, as well as seasonal and exogenous variables.

The goal was to analyze each of the datasets and understand its variations before building a model to predict and analyze the results. Various different techniques and statistical methods were used to analyze and build the final models.

Dataset

- co2.csv: Provides a time series of the concentration of CO2 recorded at Mauna Loa for each month starting from March 1958.

- PriceStats_CPI.csv: Contains the Consumer Price index (CPI) and PriceStats consumer price index starting from July 2008.

- T10YIE.csv: Contains a time series of the BER starting from January 2003. 

Access and Requirements

The file project4.ipynb is the Jupyter Notebook that contains all the code, visualizations and analysis of the project.

The dependencies and requirements can be seen from requirements.txt that can be installed in shell with the command:

  pip install -r requirements.txt

time-series-modeling's People

Contributors

jajokine avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.