Giter Site home page Giter Site logo

ds-classification-intro-nyc-career-ds-062518's Introduction

Classification

Thus far we have looked at regression and investigating how we can predict a continuous variable. Another category of problems in data science is to classify class membership. For example, we might want to predict whether or not someone has cancer, whether a video is appropriate for children, or what species an animal is. These problems are fundamentally different in their formulation because of the desired outputs.

The simplest case of this is a binary classification of 0 or 1. Typically 0 stands for 'not a member' while 1 stands for 'is a member'.

Here's some of the most important classification algorithms which we'll investigate in further depth in coming lessons:

  • Logistic Regression
    • Calculates the probability of class membership using the sigmoid function. Then assigns class membmership.
  • Decision Trees
    • Split the dataset feature by feature according to which feature will improve the accuracy of classification. For example, those with cholosterol higher than a certain value (to be specified using the tree algorithm) have risk for heart disease, those below that value don't incur a risk. Next, you can go onto another feature in the data set, such as age.
  • Random Forests
    • An ensemble method for combining multiple decision trees.
  • Support Vector Machines
    • Draws a decision plane seperating the classes. Maximizes the distance between datapoints and this plane.

Regression or Classification?

For each of the following scenarios determine whether you would apply a regression or classification algorithm.

Determining a child's future height.

#Regression or Classification?

Determining a child's future career.

#Regression or Classification?

Determining a car's brand.

#Regression or Classification?

Determining a car's year.

#Regression or Classification?

Determining a car's mileage.

#Regression or Classification?

Determining a flower's color.

#Regression or Classification?

Determining a flower's species.

#Regression or Classification?

ds-classification-intro-nyc-career-ds-062518's People

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.