The puprose of the project is to prepare data for analytical team. The project consists of the script which creates tables in Postgres database and ETL pipeline that loads data from the json files which contain users activity in the app and metadata about the songs into Postgres tables.
- execute create_tables.py - this script creates database sparkifydb and all needed tables in it.
- to test that tables were created run
- execute etl.py - this script will proccess all json files and insert data into Postgres tables
Python 3
Postgres Database
Run test.py to confirm your records were successfully inserted into each table.
Data model is represnted in star schema which consists of fact table: songplays and dimension tables: users,songs,artists,time