Giter Site home page Giter Site logo

jimoh1993 / um6p-sci-data-science-lab-project-data-distribution-stat-tests Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 28 KB

Data distribution is a function that lists out all possible values the Data can take. It can be a continuous or discrete Data distribution. Several known standard Probability Distribution functions provide probabilities of occurrence of different possible outcomes in an experiment.

Jupyter Notebook 100.00%
normaldistribution binomial-distribution pearson-correlation chi-square-test binomial-distributions bernoulli-distribution data-science-lab

um6p-sci-data-science-lab-project-data-distribution-stat-tests's Introduction

UM6P-SCI-Data-Science-Lab-Project-Data-Distribution-Stat-Tests

Data Distribution

Data distribution is a function that lists out all possible values the Data can take. It can be a continuous or discrete Data distribution. Several known standard Probability Distribution functions provide probabilities of occurrence of different possible outcomes in an experiment.But, the challenge is the real-world Data may not follow any well-known Probability Distributions. In this case, we can approximate the most probable Probability Distribution and check it’s Goodness of fit.

Advantages of knowing the underlying Probability Distribution of Data

Good Practice Many Algorithms, like Linear Regression, assumes variables to follow a particular distribution. The cost of not meeting the assumptions could be high at times. Attaching a confidence Interval Knowing the underlying probability distribution, we can find it’s Probability density function. This helps us in attaching confidence intervals to the range of values Data is likely to take. Keep track of how the Distribution has changed over time or during special events/seasons Distribution has parameters. With these parameters, we can keep track of how the Distribution has changed over time or during a particular season/event. Well known statistical properties The standard probability distributions have well-known statistical properties that simplify the job for us. We can explain the Data and its behavior with just a few parameters.

The following data distribution and statistical tests are implemented in this project:

  1. The Normal distribution
  2. The Shapiro-Wilk Normality Test
  3. The Pearson's Correlation Test
  4. The Pearson's Correlation Test
  5. The Chi-Squared Test
  6. The Student's t-test
  7. The Bernouli Distribution
  8. The Binomial Distribution

um6p-sci-data-science-lab-project-data-distribution-stat-tests's People

Contributors

jimoh1993 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.