Hello there! And welcome to my third project about data engineering.
In this one, we'll use some things in AWS as S3, Athena and Glue.
For use this project, you need to follow the steps bellow.
You need to open the /data
folder, there contains the data to run in this project.
Although, the data provided in there is malformed, so follow the instructions of the folder.
Confiture a .aws
folder or export the credentials to run the next steps.
Enter the terraform
folder and follow the instructions to set the enviroment on AWS
Run the command bellow
# This will send to the data from the `/data/trusted` to the S3
make start
Well, this project was maded to run in the AWS Console, so get the code em the sql
folder to create the landing tables.
And create Glue Jobs with the .py
files in the folder jobs
That's all! 😊