Giter Site home page Giter Site logo

azizaghabayli / expedia-analytics-project Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 32 KB

Personal project for Data Engineering Zoomcamp

Home Page: https://github.com/DataTalksClub/data-engineering-zoomcamp

HCL 2.64% Python 17.77% Dockerfile 0.50% Jupyter Notebook 79.08%
big-query google-cloud-platform google-cloud-storage python dbt docker looker-studio mage-ai spark

expedia-analytics-project's Introduction

Expedia Analytics ELT

Objective

The purpose of the ExpediaAnalytics project is to analyze and visualize hotel price trends and dynamics within the tourism sector, leveraging the Expedia Hotel Dataset. Through an end-to-end data pipeline, this project aims to provide insights into hotel pricing strategies, demand fluctuations, and market competitiveness.

Table of Contents

Architecture

[Diagram to be added]

The project architecture encompasses the following components:

  • Data ingestion from Kaggle to Google Cloud Storage (GCS) as the data lake.
  • Data processing and transformation using dbt in BigQuery.
  • Workflow orchestration with Mage to manage the data pipeline.
  • Visualization of insights through a dashboard in Google Looker Studio.

Technologies and Tools

This project utilizes a range of technologies and tools, including:

  • Google Cloud Platform (GCP) for cloud storage, data warehousing, and computing resources.
  • Google Cloud Storage (GCS) as the data lake for raw data storage.
  • BigQuery for data warehousing and SQL-based transformations.
  • dbt (Data Build Tool) for data transformation within BigQuery.
  • Mage for workflow orchestration across the data pipeline.
  • Google Looker Studio for dashboard creation and data visualization.
  • Pipenv for Python dependency management and virtual environment creation.

Installation

To get started with the ExpediaAnalytics project, follow these steps:

  1. Clone the repository.
  2. Install dependencies using Pipenv:
    pipenv install
    Note: This step will be updated with specific dependencies as the project progresses.
  3. Set up a GCP project and configure GCS and BigQuery services according to your setup.

Usage

This section will be updated with detailed instructions on running the data pipeline, executing dbt models, orchestrating workflows with Mage, and accessing the dashboard in Google Looker Studio. Steps will include:

  • Data ingestion commands or scripts.
  • dbt run commands for data transformation.
  • Mage setup and execution steps.
  • Instructions to access and interact with the Looker Studio dashboard.

Dashboard

[Dashboard to be added]

License

This project is licensed under the MIT License.

expedia-analytics-project's People

Contributors

azizaghabayli avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.