Giter Site home page Giter Site logo

diegoct6 / business-intelligence-data-warehousing Goto Github PK

View Code? Open in Web Editor NEW
10.0 2.0 10.0 12.36 MB

End-to-End BI & DW project: Data Warehousing design and modeling (MySQL), ETL (PDI) and Dashboard (Tableau)

TSQL 100.00%
business-intelligence data-warehouse etl data-integration database tableau visualization pdi mysql

business-intelligence-data-warehousing's Introduction

Business-Intelligence-Data-Warehousing

In this repository, I share an End-to-End BI & DW project done during my studies at IE University:

The dataset is an event log that describes the process of how an incident within the ServiceNow Solution for Incident Management System is reported, processed and managed within an IT company. The data is collected from an audit system for one instance of the ServiceNow platform for Year 2016 (https://www.servicenow.com/products/incident-management.html).

When an incident happens, the IT company needs to correct failures as soon as possible to minimize the impact on normal business operations.

The business objective is to help the IT company reduce the completion time for incident resolution (Ticket Completion Time). The analytic method investigates into the repetitive incident categories, resolution steps, and task overload during the processes.

This project was made possible with the help of my colleagues Nalisha Men and Paula San Roman. The original dataset can be downloaded from here: https://archive.ics.uci.edu/ml/datasets/Incident+management+process+enriched+event+log

1. Data Warehouse Modeling

The goal here is to present a design of a data warehouse/data mart for an IT Operations Log System. The main executed task were:

A. Dataset analysis:

  • Analyze the dataset: Review the fields of the dataset and its definition, understand the meaning of all of them and define user requirements considering the context of the dataset.
  • Detect potential data quality issues such as: missing values, incomplete values, errors. Make a decision for each one of these data quality issues.
  • Detect duplicates.

B. Data Warehouse approach selection: Select between using a corporate information, multidimensional approach or data vault approach.

C. Data Warehouse design: Identify the entities based on the selected approach (fact tables, dimension tables, and metrics). Implement the design with MySQL Workbench as we did during the sessions.

Attached you will find the full Data Modeling report and the designed DW Schema.

Tools used:

  • Database: MySQL
  • Database Modeling: MySQL Workbench

DW Star Schema

2. Data Integration Design

This second part is focused on Data Integration. It consists on designing an one-time historic data load using an ETL (Extraction, Transformation, Loading) tool. That means create an ETL process for each dimension and fact table.

Deliverables:

  • Data Mapping Report: this document describes the default strategy for extracting data from the source, data mapping process, data quality tracking, and metadata approach.
  • ETL scripts: The collection of ETL processes created with Pentaho Data Integration.
  • ETL Instructions: How to execute the entire data loading process.
  • Database backup: after the ETL process has been executed with success.

Tools used:

Database: MySQL ETL Tool: Pentaho Data Integration (ktr format)

Data Mapping Process

3. Incident Management System Dashboard

Who will use the dashboard?

  • IT Operations Manager: Responsible against the CIO for the Incident Management Platform and groups executing the job.

What is the decision-making objective of this dashboard? Understand the performance of the IT Service Team on a time frame, being able to identify:

  • What are the main categories associated with the incidents?
  • Is the team complying with the standard SLA?
  • Is the team following the guides and standard processes to solve the inicidents?
  • Is the team validating the priority level, specially for critical incidents?
  • Are the incidents being assigned correctly?
  • How does the reassignment count affect the average time to close an incident?
  • What are the low performing groups? What is driving this situation?

What type of content will appear in the dashboard?

  • Main Incident Management KPIs to optimize
  • Graphs comparing the composition of the KPIs.

What level of data aggregation will be required by the audience?

  • Basic level of excel and graph interpretation.
  • Capacity to filter the evaluation time (Open date of the ticket)

Tools used:

  • Tableau

IMS Dashboard

business-intelligence-data-warehousing's People

Contributors

diegoct6 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.