Raman Kaur's Projects
The Internals of Apache Spark
ETL Pipeline and Data Modelling in Apache Cassandra and PostgreSQL
Contains all of the queries used within the Complete Guide to Elasticsearch course.
Complete data-ML pipeline for forecast model of electricity consumption in Victoria
Data analysis using pandas and geo pandas involves reverse geo-coding
Collecting data using Web Scraper/ Crawler and APIs for data analysis
3NF conversion to dimensional modelling and OLAP operations, Advanced SQL queries(DML)
data warehouse & data modelling in AWS using s3 and Redshift
Web Application deployment using Docker container and Heroku
Flask App for machine learning predictions with embedded dash app
Video Analytics in Python using face-emotion-detection, speech-to-text and text-sentiment analysis pre-trained DEEP LEARNING models
The Internals of Apache Kafka
Spinning EMR cluster for elt using spark-hdfs(lake) and extracting/loading data using s3
spark analytics using pyspark, spark dataframes and spark sql, parsing user logs, handling unstructured data
Spark standalone architecture, local architecture and reading hadoop file formats i.e. avro, parquet and ORC
Apache Spark™ and Scala Workshops
Course notes for Computational Statistics and Statistical Compuing