Giter Site home page Giter Site logo

letubert / resumeparser-1 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from omkarpathak/resumeparser

0.0 3.0 0.0 1.45 MB

A simple resume parser used for extracting information from resumes

License: MIT License

Python 83.43% CSS 0.17% HTML 16.41%

resumeparser-1's Introduction

ResumeParser

A simple resume parser used for extracting information from resumes

Installation

  • For extracting text from various documents we use pdfminer and doc2text modules. Install them using:
pip install pdfminer        # python 2
pip install pdfminer.six    # python 3
pip install doc2text
  • For NLP operations we use spacy and nltk. Install them using:
# spaCy
pip install spacy
python -m spacy download en_core_web_sm

# nltk
pip install nltk
python -m nltk nltk.download('words')
  • For extracting other supporting dependencies, execute:
pip install -r requirements.txt
  • Modify skills.csv as per your requirements

  • Modify Education Degrees as per you requirements in constants.py

  • Place all the resumes that you want to parse in resumes/ directory

  • Run resume_parser.py

CLI

For running the resume extractor you can also use the cli provided

usage: cli.py [-h] [-f FILE] [-d DIRECTORY]

optional arguments:
  -h, --help                            show this help message and exit
  -f FILE, --file FILE                  resume file to be extracted
  -d DIRECTORY, --directory DIRECTORY   directory containing all the resumes to be extracted

For extracting data from a single resume file, use

python cli.py -f <resume_file_path>

For extracting data from several resumes, place them in a directory and then execute

python cli.py -d <resume_directory_path>

GUI

  • Django used
  • Easy extraction and interpretation using GUI
  • For running GUI execute:
python manage.py makemigrations
python manage.py migrate
python manage.py runserver
  • Visit 127.0.0.1 to view the GUI

Working:

Working

Result

The module would return a list of dictionary objects with result as follows:

[
    {
        'education': [('BE', '2014')],
        'email': '[email protected]',
        'mobile_number': '8087996634',
        'name': 'Omkar Pathak',
        'skills': [
            'Flask',
            'Django',
            'Mysql',
            'C',
            'Css',
            'Html',
            'Js',
            'Machine learning',
            'C++',
            'Algorithms',
            'Github',
            'Php',
            'Python',
            'Opencv'
        ]
    }
]

To DO

  • Extracting Experience
  • Extracting Projects
  • Extracting hobbies
  • Extracting universities
  • Extracting month of passing
  • Extracting Awards/ Achievements/ Recognition

References that helped me get here

resumeparser-1's People

Contributors

omkarpathak avatar dazzag24 avatar

Watchers

James Cloos avatar Leon avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.