Omar Mahmoud's Projects
Gather essential product data from Amazon with ease using this Python web scraper and Selenium. Extract product descriptions, prices, ratings, and more for insightful market research and analysis.ct data from Amazon with ease using this Python web scraper. Extract product descriptions, prices, ratings, and more for insightful market resear
Project to analyze A/B test results using python
Unlock the power of data with our comprehensive Talend project aimed at constructing a robust (DWH) from the renowned Northwind dataset. Divided into two pivotal phases, this project seamlessly integrates data from the Northwind Access Database and the Transactional Database in SQL Server.
By using AdventureWorks2022 Dataset I have built a Sales Data Mart using (SQL Server Integration Services SSIS) SQL Server involves leveraging the capabilities of Integration Services (SSIS) and the Modeling of SQL Server, This Data mart offers several benefits, making them valuable components in the main purpose of data management and analytics wi
Converting Nested JSON Structures to Pandas DataFrames
Customer Churn Data Analytics Data Pipeline using Apache Airflow, Glue, S3, Redshift, PowerBI
It's a process of preparing raw, unstructured, or messy data for analysis by using the Python programming language and Pandas library. This involves tasks such as handling missing values, removing duplicates, correcting data types, and transforming data into a more usable format. Data cleaning is a crucial step in the data preprocessing pipeline.
The Sparkify Music Streaming Analysis project focuses on creating a NoSQL database and an ETL pipeline for Sparkify, a music streaming startup. Sparkify aims to analyze the data collected from its new music streaming app, covering songs and user activities.
This challenge is designed to explore and analyze factors contributing to employee attrition in a simulated HR setting using a dataset from IBM.
When it comes to streaming media, Netflix is the king. The company that was founded 20 years ago as a mail-order DVD rental service has since transformed its business model completely to match the ever-changing tech landscape. As a result of that, the company now boasts more than 200 million subscribers worldwide and secures a spot as one of the bi
Netflix is a leading player in streaming media with over 200 million global subscribers. Its transformation from DVD rental service to media publisher through its Netflix Originals program has made it a dominant player in the industry.
This project encompasses the complete data lifecycle, from data extraction and transformation to in-depth analysis and compelling visualizations. The process is divided into three main phases
Use Case: Data Warehouse Design and ETL Process for Healthcare Data and get insights using SSAS
Working with JSON (JavaScript Object Notation) in Python is quite straightforward and commonly used, especially when dealing with data interchange between different systems or when storing configuration data. Python provides built-in libraries for working with JSON data. Here's how you can work with JSON in Python
This repo hosts the solutions for a comprehensive workshop on mastering Python programming through HackerRank challenges. Whether you're a beginner looking to learn Python from scratch or an experienced programmer aiming to enhance your Python skills, this workshop provides a structured learning path that covers a wide range of topics.
Unveiling Cinematic Brilliance: Illuminating the Future of Movie Production through Data-Driven Insights
Build an OLAP Cube in SSAS from SQL Server Analysis Services Data
Creating a data pipeline using Airflow. The pipeline will download podcast episodes and automatically transcribe them using speech recognition. We'll store our results in a SQLite database that we can easily query.