Sales/Demand forecasting

Dataset

The initial purpose of the dataset was to forecast the total amount of products sold in every shop for the test set. The test set must be created using the train dataset's last month (October) without the target (item_cnt_day). You need to forecast the sales for the shops and products of the resulting dataset for October 2015. This allows you to compare each method's predicted data with actual data. Note that the list of shops and products slightly changes every month. Creating a robust model to handle such situations is part of the challenge.

While keeping the initial purpose in mind, here I am using daily historical sales data to learn time series forecasting methods. Instead of using the test dataset provided ( which does not contain the target data), I am using the last month of the training dataset as the test/validation data for the model.

CSV files

sales_train.csv - the training set. Daily historical data from January 2013 to October 2015.
items.csv - supplemental information about the items/products.
item_categories.csv - supplemental information about the items categories.
shops.csv- supplemental information about the shops.

Data fields

shop_id - unique identifier of a shop
item_id - unique identifier of a product
item_category_id - unique identifier of item category
item_cnt_day - number of products sold. You are predicting a monthly amount of this measure
item_price - current price of an item
date - date in format dd/mm/yyyy
date_block_num - a consecutive month number, used for convenience. January 2013 is 0, February 2013 is 1,..., October 2015 is 33
item_name - name of item ( In Russian)
shop_name - name of shop ( In Russian)
item_category_name - name of item category (In Russian)

Process

Exploratory Data Analysis
Feature Engineering
Post-Feature-Engineering EDA
Prediction accuracy measurement
Forecasting
1. Moving average
2. ARIMA
3. SARIMA
4. Exponential Smoothing
5. Regression

Tools and libraries

Python / Pandas / Numpy / matplotlib
Jupyter-Lab
sklearn.metrics
statsmodels
pmdarima

zzzhenya / sales_forecasting Goto Github PK

sales_forecasting's Introduction

Sales/Demand forecasting

Dataset

CSV files

Data fields

Process

Tools and libraries

References

sales_forecasting's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent