Giter Site home page Giter Site logo

gender_classification_challenge's Introduction

gender_classification_challenge

##Overview

This is the code for the gender classification challenge for 'Learn Python for Data Science #1' by @Sirajology on YouTube. The code uses the scikit-learn machine learning library to train a decision tree on a small dataset of body metrics (height, width, and shoe size) labeled male or female. Then we can predict the gender of someone given a novel set of body metrics.

##Dependencies

Install missing dependencies using pip

##Usage

Once you have your dependencies installed via pip, run the script in terminal via

python demo.py

##Challenge

Find 3 more classifiers from the sci-kit learn documentation and add them to the demo.py code. Train them on the same dataset and compare their results. You can determine accuracy by trying to predict testing you trained classifier on samples from the training data and see if it correctly classifies it. Push your code repository to github then post it in the comments. I'll give the winner a shoutout a week from now!

##Credits

Credits for some of the code go to chribsen. I've merely created a wrapper to get people started easily.

gender_classification_challenge's People

Contributors

llsourcell avatar rhnvrm avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

gender_classification_challenge's Issues

Running demo.py gives this error :

Running demo.py gives this error :

manu@kali:~$ python demo.py
Traceback (most recent call last):
File "demo.py", line 1, in
from sklearn import tree
ModuleNotFoundError: No module named 'sklearn'

Decision tree with string values

Hey there how are you,

I am new to python and doing PBL learning, I was looking at the challenge with the classifier, and wanted to run the same decision tree but with other classifiers as string values to determine either male or female by Movie title, hair length and food. Here is the code to see if you can help me, and this is the error I get when running it.

Code:

from sklearn import tree

clf = tree.DecisionTreeClassifier()

#[movie, hair, food]
X = [['Black Panther', 'Long', 'Burger'],
['Avengers', 'Short', 'Taco'],
['96', 'Long', 'Pizza'],
['Spiderman', 'Long', 'Burrito'],
['Roma', 'Short', 'Cheesesteak'],
['Greenbreak', 'Long', 'Icecream'],
['A star is Born', 'Long', 'Hot Pocket'],
['The favourite', 'Short', 'Nachos'],
['A quiet place', 'Long', 'Salad'],
['Ant-man', 'Short', 'Pop-corn'],
['Solo', 'Long', 'Chocolate']]

Y = ['male', 'male', 'female', 'female', 'male', 'male', 'female', 'female',
'female', 'male', 'male']

clf = clf.fit(X, Y)

prediction = clf.predict([['Mary Poppins','Long','Candy']])

print(prediction)

Error:

Traceback (most recent call last):
File "D:\Python\demo2.py", line 21, in
clf = clf.fit(X, Y)
File "D:\Python\lib\site-packages\sklearn\tree\tree.py", line 801, in fit
X_idx_sorted=X_idx_sorted)
File "D:\Python\lib\site-packages\sklearn\tree\tree.py", line 116, in fit
X = check_array(X, dtype=DTYPE, accept_sparse="csc")
File "D:\Python\lib\site-packages\sklearn\utils\validation.py", line 527, in check_array
array = np.asarray(array, dtype=dtype, order=order)
File "D:\Python\lib\site-packages\numpy\core\numeric.py", line 538, in asarray
return array(a, dtype, copy=False, order=order)
ValueError: could not convert string to float: 'Black Panther'

capture

Cannot contribute (create pull request or push to origin)

There is a spelling mistake and it is very distracting

instead of 'reusults' it should be 'results'

I wanted to help. So I created a commit.

image

But I'm not able to push it to origin

I tried to create a pull request but the button is disabled
image

image

scipy does not install

C:\LEARN PYTHON\sirajology\gender_classification_challenge>pip install scipy
Collecting scipy
Using cached scipy-0.18.1.tar.gz
Building wheels for collected packages: scipy
Running setup.py bdist_wheel for scipy ... error
Complete output from command c:\python27\python.exe -u -c "import setuptools, tokenize;file='c:\users\florinb\appd
lapack_opt_info:
openblas_lapack_info:
libraries openblas not found in ['c:\python27\lib', 'C:\', 'c:\python27\libs']
NOT AVAILABLE

lapack_mkl_info:
mkl_info:
libraries mkl,vml,guide not found in ['c:\python27\lib', 'C:\', 'c:\python27\libs']
NOT AVAILABLE

NOT AVAILABLE

atlas_3_10_threads_info:
Setting PTATLAS=ATLAS
c:\python27\lib\site-packages\numpy\distutils\system_info.py:639: UserWarning: Specified path C:\projects\windows-wheel-b
warnings.warn('Specified path %s is invalid.' % d)
<class 'numpy.distutils.system_info.atlas_3_10_threads_info'>
NOT AVAILABLE

atlas_3_10_info:
<class 'numpy.distutils.system_info.atlas_3_10_info'>
NOT AVAILABLE

atlas_threads_info:
Setting PTATLAS=ATLAS
<class 'numpy.distutils.system_info.atlas_threads_info'>
NOT AVAILABLE

atlas_info:
<class 'numpy.distutils.system_info.atlas_info'>
NOT AVAILABLE

c:\python27\lib\site-packages\numpy\distutils\system_info.py:1548: UserWarning:
Atlas (http://math-atlas.sourceforge.net/) libraries not found.
Directories to search for the libraries can be specified in the
numpy/distutils/site.cfg file (section [atlas]) or by setting
the ATLAS environment variable.
warnings.warn(AtlasNotFoundError.doc)
lapack_info:
libraries lapack not found in ['c:\python27\lib', 'C:\', 'c:\python27\libs']
NOT AVAILABLE

c:\python27\lib\site-packages\numpy\distutils\system_info.py:1559: UserWarning:
Lapack (http://www.netlib.org/lapack/) libraries not found.
Directories to search for the libraries can be specified in the
numpy/distutils/site.cfg file (section [lapack]) or by setting
the LAPACK environment variable.
warnings.warn(LapackNotFoundError.doc)
lapack_src_info:
NOT AVAILABLE

c:\python27\lib\site-packages\numpy\distutils\system_info.py:1562: UserWarning:
Lapack (http://www.netlib.org/lapack/) sources not found.
Directories to search for the sources can be specified in the
numpy/distutils/site.cfg file (section [lapack_src]) or by setting
the LAPACK_SRC environment variable.
warnings.warn(LapackSrcNotFoundError.doc)
NOT AVAILABLE

Running from scipy source directory.
Traceback (most recent call last):
File "", line 1, in
File "c:\users\florinb\appdata\local\temp\pip-build-gyfha4\scipy\setup.py", line 415, in
setup_package()
File "c:\users\florinb\appdata\local\temp\pip-build-gyfha4\scipy\setup.py", line 411, in setup_package
setup(**metadata)
File "c:\python27\lib\site-packages\numpy\distutils\core.py", line 135, in setup
config = configuration()
File "c:\users\florinb\appdata\local\temp\pip-build-gyfha4\scipy\setup.py", line 335, in configuration
config.add_subpackage('scipy')
File "c:\python27\lib\site-packages\numpy\distutils\misc_util.py", line 1003, in add_subpackage
caller_level = 2)
File "c:\python27\lib\site-packages\numpy\distutils\misc_util.py", line 972, in get_subpackage
caller_level = caller_level + 1)
File "c:\python27\lib\site-packages\numpy\distutils\misc_util.py", line 909, in _get_configuration_from_setup_py
config = setup_module.configuration(*args)
File "scipy\setup.py", line 15, in configuration
config.add_subpackage('linalg')
File "c:\python27\lib\site-packages\numpy\distutils\misc_util.py", line 1003, in add_subpackage
caller_level = 2)
File "c:\python27\lib\site-packages\numpy\distutils\misc_util.py", line 972, in get_subpackage
caller_level = caller_level + 1)
File "c:\python27\lib\site-packages\numpy\distutils\misc_util.py", line 909, in _get_configuration_from_setup_py
config = setup_module.configuration(*args)
File "scipy\linalg\setup.py", line 20, in configuration
raise NotFoundError('no lapack/blas resources found')
numpy.distutils.system_info.NotFoundError: no lapack/blas resources found


Failed building wheel for scipy
Running setup.py clean for scipy
Complete output from command c:\python27\python.exe -u -c "import setuptools, tokenize;file='c:\users\florinb\appd

setup.py clean is not supported, use one of the following instead:

- `git clean -xdf` (cleans all files)
- `git clean -Xdf` (cleans all versioned files, doesn't touch
                    files that aren't checked into the git repo)

Add --force to your command to use it anyway if you must (unsupported).


Failed cleaning build dir for scipy
Failed to build scipy
Installing collected packages: scipy
Running setup.py install for scipy ... error
Complete output from command c:\python27\python.exe -u -c "import setuptools, tokenize;file='c:\users\florinb\ap
ingle-version-externally-managed --compile:

Note: if you need reliable uninstall behavior, then install
with pip instead of using `setup.py install`:

  - `pip install .`       (from a git repo or downloaded source
                           release)
  - `pip install scipy`   (last SciPy release on PyPI)


lapack_opt_info:
openblas_lapack_info:
  libraries openblas not found in ['c:\\python27\\lib', 'C:\\', 'c:\\python27\\libs']
  NOT AVAILABLE

lapack_mkl_info:
mkl_info:
  libraries mkl,vml,guide not found in ['c:\\python27\\lib', 'C:\\', 'c:\\python27\\libs']
  NOT AVAILABLE

  NOT AVAILABLE

atlas_3_10_threads_info:
Setting PTATLAS=ATLAS
c:\python27\lib\site-packages\numpy\distutils\system_info.py:639: UserWarning: Specified path C:\projects\windows-wheel
  warnings.warn('Specified path %s is invalid.' % d)
<class 'numpy.distutils.system_info.atlas_3_10_threads_info'>
  NOT AVAILABLE

atlas_3_10_info:
<class 'numpy.distutils.system_info.atlas_3_10_info'>
  NOT AVAILABLE

atlas_threads_info:
Setting PTATLAS=ATLAS
<class 'numpy.distutils.system_info.atlas_threads_info'>
  NOT AVAILABLE

atlas_info:
<class 'numpy.distutils.system_info.atlas_info'>
  NOT AVAILABLE

c:\python27\lib\site-packages\numpy\distutils\system_info.py:1548: UserWarning:
    Atlas (http://math-atlas.sourceforge.net/) libraries not found.
    Directories to search for the libraries can be specified in the
    numpy/distutils/site.cfg file (section [atlas]) or by setting
    the ATLAS environment variable.
  warnings.warn(AtlasNotFoundError.__doc__)
lapack_info:
  libraries lapack not found in ['c:\\python27\\lib', 'C:\\', 'c:\\python27\\libs']
  NOT AVAILABLE

c:\python27\lib\site-packages\numpy\distutils\system_info.py:1559: UserWarning:
    Lapack (http://www.netlib.org/lapack/) libraries not found.
    Directories to search for the libraries can be specified in the
    numpy/distutils/site.cfg file (section [lapack]) or by setting
    the LAPACK environment variable.
  warnings.warn(LapackNotFoundError.__doc__)
lapack_src_info:
  NOT AVAILABLE

c:\python27\lib\site-packages\numpy\distutils\system_info.py:1562: UserWarning:
    Lapack (http://www.netlib.org/lapack/) sources not found.
    Directories to search for the sources can be specified in the
    numpy/distutils/site.cfg file (section [lapack_src]) or by setting
    the LAPACK_SRC environment variable.
  warnings.warn(LapackSrcNotFoundError.__doc__)
  NOT AVAILABLE

Running from scipy source directory.
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "c:\users\florinb\appdata\local\temp\pip-build-gyfha4\scipy\setup.py", line 415, in <module>
    setup_package()
  File "c:\users\florinb\appdata\local\temp\pip-build-gyfha4\scipy\setup.py", line 411, in setup_package
    setup(**metadata)
  File "c:\python27\lib\site-packages\numpy\distutils\core.py", line 135, in setup
    config = configuration()
  File "c:\users\florinb\appdata\local\temp\pip-build-gyfha4\scipy\setup.py", line 335, in configuration
    config.add_subpackage('scipy')
  File "c:\python27\lib\site-packages\numpy\distutils\misc_util.py", line 1003, in add_subpackage
    caller_level = 2)
  File "c:\python27\lib\site-packages\numpy\distutils\misc_util.py", line 972, in get_subpackage
    caller_level = caller_level + 1)
  File "c:\python27\lib\site-packages\numpy\distutils\misc_util.py", line 909, in _get_configuration_from_setup_py
    config = setup_module.configuration(*args)
  File "scipy\setup.py", line 15, in configuration
    config.add_subpackage('linalg')
  File "c:\python27\lib\site-packages\numpy\distutils\misc_util.py", line 1003, in add_subpackage
    caller_level = 2)
  File "c:\python27\lib\site-packages\numpy\distutils\misc_util.py", line 972, in get_subpackage
    caller_level = caller_level + 1)
  File "c:\python27\lib\site-packages\numpy\distutils\misc_util.py", line 909, in _get_configuration_from_setup_py
    config = setup_module.configuration(*args)
  File "scipy\linalg\setup.py", line 20, in configuration
    raise NotFoundError('no lapack/blas resources found')
numpy.distutils.system_info.NotFoundError: no lapack/blas resources found

----------------------------------------

Command "c:\python27\python.exe -u -c "import setuptools, tokenize;file='c:\users\florinb\appdata\local\temp\pip-
-managed --compile" failed with error code 1 in c:\users\florinb\appdata\local\temp\pip-build-gyfha4\scipy\

ImportError: No module named sklearn

Hi Siraj,

I'm a beginner in machine learning, I'm started learning from your Youtube videos, that way I here, I got

Traceback (most recent call last):
  File "demo.py", line 1, in <module>
    from sklearn import tree
ImportError: No module named sklearn

error when to execute demo.py here is env

screen shot 2018-03-08 at 5 37 49 pm

screen shot 2018-03-08 at 5 41 00 pm

please help me thod fix this

Thanks,
Pavinthan Sivakumar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.