Giter Site home page Giter Site logo

Portfolio of Data Science Projects by Marcos Brum

Hi there 👋, welcome to my portfolio. Here you will find links to the Data Science projects I have been working on. The purpose of these projects is to demonstrate my skills in solving business problems using techniques and tools of Data Science.

Marcos Brum

I am a data scientist experienced in developing business solutions, from the understanding of the business problem to interpreting the model results in terms of business value.

Skills:
Artificial Intelligence: Machine/Deep Learning; Quantum Machine Learning; Large Language Models; Natural Language Processing
Science: Physics: Quantum Physics, Relativistic Physics; Mathematics: Functional Analysis, Differential Equations
Programming Languages: Python; JavaScript; C++
Technologies: Qiskit; Git; Docker; LaTeX
Languages: Portuguese, English, German

Contacts

LinkedIn

ResearchGate

Data Science Projects

Rossmann Sales forecast

The stores of the Rossmann drugstore chain need to be restored and the CEO needs to decide how much is going to be dedicated to the restoration of each one. To support this decision, the Analytics team is asked to present a sales forecast for each store during a period of six weeks, alongside with the total income expected in the chain. This forecast also informs the CEO which store is able to account for its own restoration with the income within this period.

The gross expected income of the majority of stores is in the range between R$5000.00 and R$22000.00. The chain is expected to obtain R$289,822,112.00, with best and worst case scenarios of R$290,808,412.17 and R$288,835,860.27, respectively. These scenarios are predicted using statistical errors (mean absolute percentage error).

Insurance Cross-sell

A health insurance company intends to offer its customers a new product, a vehicle insurance. In order to achieve this purpose efficiently, it gathered some information about their customers and asked if they would be interested in purchasing a new vehicle insurance. This information was passed on to a Data Science Consulting office.

The office delivered a report informing, among all features gathered, the most relevant ones and the probability of purchase from each customer. Qualitatively, the predicted probability provides a lift gain of 2.5, thus reducing the sales cost to 40%.

Scientific Papers Classification

ArXiv is a public repository of scientific papers where researchers and students from anywhere in the world can find the latest results in many disciplines ranging from Natural Sciences to Mathematics and Computer Science. The papers are categorized by discipline and (possibly multiple) subdiscipline. The platform also displays each paper's abstract without the need to download the file.

The categorization of a new paper is important for authors to make sure their research will reach the intended audience and for the readers to find the most relevant works in their interest area. Presently the categorization process is the authors' sole responsibility. This fact raises some questions:

  1. Is the category chosen for a research paper the most appropriate?
  2. Is it possible to make a reasonable prediction of a paper's category given only a summary or it's abstract?

In this project we display how a Large Language Model can be leveraged to help classify a scientific paper. We will employ the transfer learning technique using a pretrained transformer model to predict paper's categories.

Marcos Brum's Projects

lnn icon lnn

A `Neural = Symbolic` framework for sound and complete weighted real-value logic

scientific_production icon scientific_production

This repository contains the list of my academic production - scientific papers and Graduate courses lectured.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.