Giter Site home page Giter Site logo

meyssa / prep_unit_project Goto Github PK

View Code? Open in Web Editor NEW

This project forked from cla-data-science-bootcamp/prep_unit_project

0.0 0.0 0.0 80 KB

This repository contains the project of the preparatory unit

License: MIT License

Python 2.31% Jupyter Notebook 97.69%

prep_unit_project's Introduction

prep_unit_project

Project Overview:

There are many cases when as data scientists we need to find the best curve’s equation that can fit a given dataset or sample. In other words, you have some data points and you want to know what is the curve that can modelise them the best. This is known as the “Curve-fitting problem”.

In this project we will work you through an example of this problem. This project is divided into sections where in each section you will be exercising to use one of the fundamental concepts we saw in this first unit: From OOP, to SQL to Linear Algebra to the final Machine learning pipeline lesson.

You will have enough guidance throughout the project sections and this project will be graded and reviewed by a teacher assistant as well as the chance for you to reach out to the TA via slack whenever you feel you are stuck.

The key is to know that the best way to learn is the hardest! :)

Details:

We will be using a special dataset called “Longley's Economic Regression Data” and we will study the problem step by step:

  1. You will need to create a new environment for your project including all necessary libraries installed in it. Also Keep in mind that it’s always a good practice to use a Jupyter notebook as a tool to experiment and not to write the whole project’s code in one python file. This will add visibility and organization to what you do.

  2. The first section is about fetching your data which is provided to you in the file longley.db in q SQLlite database format, your first mission is to read this database and retrieve your data.

  3. The second section is about managing the data and variable types. Then there is the linear Algebra part where you get to learn how to make a working curve fitting example.

Getting started:

  1. Fork this repository into your Github account. To do so, click on the fork button in the upper right hand corner of a repo page.
  2. Head to the forked version on your github then clone it on your local space.
  3. Open the notebook "experiment.ipynb" and you will find the necessary steps and guidance you need for this project.
  4. Remember: only the sections which are marked as "To-Do" are the ones you should be changing.
  5. Feel free to manage your versions as you want.
  6. Once you finished your project and you are confident about the results, You have to push those details to your forked version of the project The following Git commands should be helpful for you as a reminder : $git add . $git commit -m "commit msg" $git push origin main
  7. Once you're done working on the project, submit the link to you repository in the platform.

prep_unit_project's People

Contributors

zsoumia avatar meyssa avatar affifboudaoud avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.