Example process for the creation of a Postgres database using a simple star schema, and populating it with sample data.
The database describes and stores data related to an imaginary song streaming service, "Sparkify". This includes information related to songs, artists, users, and song plays.
data/
directory containing sample song files and logs of song play eventssql_queries.py
SQL commands stored as text strings for table creation, insertion, and queryingcreate_tables.py
Initializes the sparkify database, and runs queries for table creation fromsql_queries.py
etl.py
connects to sparkify db, processes files fromdata/
directory, and inserts transformed data into db using queries fromsql_queries.py
etl.ipynb
workbook for developing processes used inetl.py
test.ipynb
connects to sparkify db and runs test queries to confirm successful table creation and data insertion
- run
create_tables.py
- run
etl.py