Giter Site home page Giter Site logo

taiyipan / job_finder Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 821 KB

A complete deep learning powered artificial intelligence program that does daily job recommendations for you

License: GNU General Public License v3.0

Python 99.41% Shell 0.59%

job_finder's Introduction

Job Finder AI

Powered by: data mining, web automation, deep learning, TensorFlow, Google's Universal Sentence Encoder

Do you find job hunting in this modern world such boring work, just as I do? Fear not, here is an AI job finder software for you! The AI learns from your job preferences and automatically browses Indeed everyday and sends you personalized new job recommendations tailored to you. Say goodbye to hours of your life wasted on chores, and spend the minimal time to send over resumes. Focus on what's important in life. Automate away the chores.

In development, this project is divided into 3 parts.

  1. Data mining and labeling
    1. Utilizes python selenium API to scrap Indeed.com for data
    2. Label positive data. Dataset is small.
    3. Mine large amounts of negative data. Dataset here is huge.
  2. Train, evaluate, and export deep learning model
    1. Uses tf.keras API to construct sequential model
    2. Model stacks Google's Universal Sentence Encoder module with a DNN top level classifier
    3. Uses class weights to compensate for a highly unbalanced dataset
    4. Train to maximize val_auc value (auc: Area Under the Curve)
    5. Further optimize neural net architecture using Keras Tuner and Hyperband algorithm to fine tune hyperparameters
    6. Evaluate based on confusion matrix, true/false postives/negatives, precision and recall
    7. Export trained model to Saved Model format
    8. Have a dedicated computer as a host tf_server
  3. Indeed crawler
    1. Uses selenium headless mode for background automation tasks
    2. Performs task daily to find new jobs in the area around the human user

In production, the task is shared between 2 computers or virtual machines.

  1. Computer A performs daily web crawling on Indeed.com and collects new jobs in the area.
  2. Computer B acts as the TensorFlow model server.
  3. A collects input data, sends to B.
  4. B performs 1 batched feedforward pass on its neural net, and sends output vector to A.
  5. A interprets the output vector, and compiles report of its findings, sends to human user.

alt text

a sample email report. links provided for easier access to job page. ranked based on probability.

alt text

model metrics when evaluated against probability_threshold = 0.5

alt text

model architecture. majority of model is frozen. module through transfer learning from tf_hub

job_finder's People

Contributors

taiyipan avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.