Giter Site home page Giter Site logo

stats_bg's Introduction

It's me, Guilherme Noronha

Data Engineer | Python Developer


  • 📊 Working as a Data Engineer at the moment.
  • 🌱 Learning about Data Quality

Career

📊 Data Engineer - Five Acts (07/2022 - 12/2023)

📊 Data Engineer Professor - IGTI (12/2020 - 11/2022)

📊 Computer Science Professor - IFMG (08/2019 - 12/2019)

📊 Information Science Professor - UFMG (2016; 2017; 2020; 2021)

📊 Software Developer - Sisloc Softwares (10/2011 - 02/2014)

Education

👨‍🎓 Ph.D. in Knowledge Organization and Management - UFMG (2018-2022)

👨‍🎓 Master in Knowledge Organization and Management - UFMG (2015-2017)

👨‍🎓 Bachelor in Computer Science - PUC Minas (2008 - 2012)

Publications

📃 PROCESSAMENTO DE LINGUAGEM NATURAL PARA ANÁLISE DE SENTIMENTO UTILIZANDO PYTHON - Tecnologias utilizadas em pesquisas acadêmicas em Ciências Sociais Aplicadas

📃 ENSINO SUPERIOR DURANTE A PANDEMIA: UMA EXPERIÊNCIA DOCENTE EM SISTEMAS DE RECUPERAÇÃO DA INFORMAÇÃO - Informação & Informação

📃 Representação lógica do conhecimento para ontologias - Fronteiras da Representação do Conhecimento

📃 Análise gramatical de etiquetas em sites colaborativos: estudo de caso Flickr - Comunicação & Informação

📃 DESIDENTIFICAÇÃO DE DOCUMENTOS CLÍNICOS PARA PESQUISA BIOMÉDICA - REPRESENTAÇÃO DO CONHECIMENTO, ONTOLOGIAS E LINGUAGEM: pesquisa aplicada em Ciência da Informação

📃 Using Adaptive Automata as a Phonetic Routine Prototype - IEEE Latin America Transactions

visitors

Tools

Languages

python   sql   bash

Databases

postgresql   mysql   sqlite   mongodb

Frameworks

airflow   dbt

Packages

pandas   scikit_learn   selenium

OS

ubuntu   windows  

Other

aws   aws  

stats_bg's People

Contributors

guilhermenoronha avatar

Watchers

 avatar

stats_bg's Issues

Add last_date on persons and games table

Add a last_date on persons table to indicate which was the last time a person attended to a game. Do the same on games table to indicate what was the last time a game was played.

Create boardgame taxonomy tables

Create 4 tables using Ludopedia Scrapper:

  • BG_DOMAINS: with game domain
  • BG_THEMES: with game themes
  • BG_CATEGORIES: with game cateogories
  • BG_MECHANICS: with game mechanics

Each game may have multiples taxonomies in each table.

Matches table is created wrongly

The game_id on matches table is, in fact, showing the game_owner_id. The correct would to show the unique id from every game instead.

Add data tests

Use great expectations or similar to guarantee data quality

Create a Ludopedia Scrapper

Create a scrapper for ludopedia to get user's collection and board games metadata to enhance the data analysis

Implement database to support multple sessions in the same day

Actually, the database doesn't support multiple sessions in the same day.

Example:

Player 1 hosted a game in his house.
Player 2 hosted a game in his house.

The attendances and matches databases should distinct all the matches and attendances for this day as separated ones.

Add column game_details in stats_bg sheets

The matches sheet in stats_bg must have a game_details column to provide more details about the matches. Below are some examples:

  • Register different abilities used by the players
  • Register the main strategy used by the players
  • Register the game mode played
  • Etc.

Use weighted average on players' ranking

Players ranking should be an average weight which encompass the game weight and the number of opponents in a match:

$$ R = \frac{ \sum_{i=1}^n (gameWeight_i * numPlayers_i * playerRank) } { \sum_{i=1}^n (gameWeight_i * numPlayers_i) } $$

Add KPIs

Implement following KPIs to be loaded on dataviz (subject to changes)

KPIs (Board game):

  • How many board games we have?
  • How distributed the board games are (per member)?
  • Which are the mechanic most popular?
  • Which are the category most popular?
  • Which are the theme most popular?
  • What is the average weight of the board games?
  • What is the recommended number of players (min, max, avg)?

KPIs (Session)

  • How many sessions we had?
  • What is the average games per session?
  • What is the average players per session?
  • How is the interchanging players between sessions?
  • How is the interchanging hosts between sessions?
  • How long a member showed up last time?
  • How long a game was played last time?

Add Awards KPIs

Add the following KPIs on DBT:

  • Host of the year
  • Addicted of the year
  • Group of the year
  • Game of the ear
  • Explorer of the year
  • category of the year
  • mechanic of the year
  • theme of the year
  • strategist of the year
  • sommelier of the year

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.