Giter Site home page Giter Site logo

sql_eda's Introduction

SQL_EDA

Brazilian E-Commerce Public Dataset by Olist

About Dataset Brazilian E-Commerce Public Dataset by Olist Welcome! This is a Brazilian ecommerce public dataset of orders made at Olist Store. The dataset has information of 100k orders from 2016 to 2018 made at multiple marketplaces in Brazil. Its features allows viewing an order from multiple dimensions: from order status, price, payment and freight performance to customer location, product attributes and finally reviews written by customers. We also released a geolocation dataset that relates Brazilian zip codes to lat/lng coordinates.

This is real commercial data, it has been anonymised, and references to the companies and partners in the review text have been replaced with the names of Game of Thrones great houses.

image

image

Conclusions and Recomendations:

Query 1: Count and % of orders purchased in Jan18 with a 5 review score.

With this query, you can identify the quantity and percentage of orders made in January 2018 that received a review score of 5. Recommendation: This information can be useful for assessing customer satisfaction and the quality of products or services.

Query 2: Customer purchase trend year-on-year.

This query shows the count of delivered orders per month, separated by year, allowing you to analyze the customer's purchase trend over time.

Recommendation: Based on this query, you can identify which months and years had a higher volume of orders and detect possible patterns in customer behavior. This information can be used to adjust marketing strategies, seasonal promotions, or other targeted actions to drive sales.

Query 3: Delivery success rate across states.

This query calculates the delivery success rate by state, providing insights into the effectiveness of the delivery process across different regions.

Recommendation: Based on this query, you can identify states with high delivery success rates and states with lower rates. This can help direct efforts towards improving logistics efficiency in areas performing below average, ensuring a better customer experience and avoiding delivery issues. Additionally, this information can be valuable for negotiations with logistics partners and optimizing the geographical distribution of distribution centers.

Sources: kaggle.com and towardsdatascience.com

sql_eda's People

Contributors

beto-amaral avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.