Giter Site home page Giter Site logo

gogsbread / resumeparser Goto Github PK

View Code? Open in Web Editor NEW
282.0 282.0 156.0 63.39 MB

Resume Parser using rule based approach. Developed using framework provided by GATE

License: GNU Lesser General Public License v3.0

Shell 0.86% Perl 0.62% CSS 0.45% Groovy 0.30% Java 30.76% JavaScript 0.03% Batchfile 0.01% HTML 66.97%

resumeparser's People

Contributors

gogsbread avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

resumeparser's Issues

How to recompile and make changes

I am pretty new to java and working on NLP project using python mostly.

I need your help on recompiling the project and make some changes

Any help from the community will be appreciated

Opening project in Eclipse

Hi,
I am pretty new to Java.
I am trying to open the project in Eclipse by using the option to Import Existing Projects into Workspaces.
I select the root folder of this project and I get 4 project to choose. I choose all of them and I get this error:
Project 'GATE-plugin-Crowd_Sourcing' is missing required Java project: 'GATE' GATE-plugin-Crowd_Sourcing Build path Build Path Problem

I have even tried setting the variables in the environment and project (GATE to the GateFiles folder).
It isn't working.

How to change the ANNIEGazetterFiles

I want to edit the files in ANNIEGazetterFiles i.e. files that contains all the compiled lists for common resume section titles. I tried doing the same using notepad++ but the json output didn't change.

Issue..

java.io.FileNotFoundException: D:\ANNIEResumeParser.gapp (The system cannot find the file specified)
at java.io.FileInputStream.open0(Native Method)
at java.io.FileInputStream.open(FileInputStream.java:195)
at java.io.FileInputStream.(FileInputStream.java:138)
at java.io.FileInputStream.(FileInputStream.java:93)
at sun.net.www.protocol.file.FileURLConnection.connect(FileURLConnection.java:90)
at sun.net.www.protocol.file.FileURLConnection.getInputStream(FileURLConnection.java:188)
at java.net.URL.openStream(URL.java:1038)
at gate.util.persistence.PersistenceManager.isXmlApplicationFile(PersistenceManager.java:1013)
at gate.util.persistence.PersistenceManager.loadObjectFromUrl(PersistenceManager.java:857)
at gate.util.persistence.PersistenceManager.loadObjectFromFile(PersistenceManager.java:831)
at ResumeParser.ResumeTransducer.src.code4goal.antony.resumeparser.Annie.initAnnie(Annie.java:40)
at ResumeParser.ResumeTransducer.src.code4goal.antony.resumeparser.ResumeParserProgram.loadGateAndAnnie(ResumeParserProgram.java:95)
at ResumeParser.ResumeTransducer.src.code4goal.antony.resumeparser.ResumeParserProgram.main(ResumeParserProgram.java:274)

SectionBody question

Hey, I was debugging and looking through the code all day,
And I have a question when find the section header, then how the section body offsets (start and end) are calculated (start offset is the end offset of header section), but what about the end offset is it the start offset of the next section or no and where I can see the logic.
Also another question if we have more than 1 work exp can this logic be extended?
I.e.

  1. 2015-2016 .... doing something
  2. 2013-2015 .. doing something
    Jape rules will catch only of these

[Errno 2] No such file or directory

I'm having issues to use this program in django/python.
I have installed ResumeParser in the Django Project like:

-- Django Project
-- app1
-- app2
-- ResumerParser

Here is my code but it says "No such file or directory".

if form.is_valid():
f = form.save(commit=False)

resume = form.cleaned_data['resume']
f.resume = resume
f.save()

response = subprocess.call(["java -cp 'bin/*:../GATEFiles/lib/*:../GATEFiles/bin/gate.jar:lib/*' code4goal.antony.resumeparser.ResumeParserProgram f.resume.url cv.json"])

Parse Chinese cvs

This is a good project, while it can not deal with Chinese resumes. do you have any ideas how to make full use of Chinese parser to parse Chinese resumes?

i am not able to parse name from resume

Hi ,

i could not able to parse name from application rest all fields are working but name i am not getting.
how can i add extra fields to extract from resume
how can i do with bulk resumes i am new to programing ca you show me that

machine learning

I need to know what exactly and where the machine leaning concept used here in this project by @antonydeepak . Can JAPE and the GATE be called as the machine learning concept??

Error: Could not find or load main class code4goal.antony.resumeparser.ResumeParserProgram

Using Python------------------------------

import os
import subprocess
print(os.getcwd())
cmd = "java -cp 'bin/:../GATEFiles/lib/:../GATEFiles/bin/gate.jar:lib/*' code4goal.antony.resumeparser.ResumeParserProgram %s %s.json" % (r'C:\Data Science\ResumeParser-master\ResumeParser\Data\Input\Synechron_Candidate_Sandip Shinde.docx', r'C:\Data Science\ResumeParser-master\ResumeParser\Data\Output\Sandip.json')
print(cmd)
os.chdir(r"C:\Data Science\ResumeParser-master\ResumeParser\ResumeTransducer")

process = subprocess.Popen(cmd)
out, err = process.communicate()
exitcode = process.returncode
print(out, exitcode)

print ('Ok')

Error: Could not find or load main class code4goal.antony.resumeparser.ResumeParserProgram

gate.util.GateException: couldn't open creole.xml

Hi Antony,
Thanks for the parser, it was working great whenever i process resume directly in the powershell. But
when i import the project(Resume Transducer) in eclipse, and tried to run the project am getting the following error at

Could not reload creole directory file:/D:/ResumeParser/GATEFiles/plugins/ANNIE/
gate.util.GateException: couldn't open creole.xml
at gate.creole.CreoleRegisterImpl.registerDirectories(CreoleRegisterImpl.java:299)
at gate.util.persistence.PersistenceManager.loadObjectFromUrl(PersistenceManager.java:921)
at gate.util.persistence.PersistenceManager.loadObjectFromFile(PersistenceManager.java:841)
at code4goal.antony.resumeparser.Annie.initAnnie(Annie.java:38)
at code4goal.antony.resumeparser.ResumeParserProgram.loadGateAndAnnie(ResumeParserProgram.java:105)
at code4goal.antony.resumeparser.ResumeParserProgram.main(ResumeParserProgram.java:286)

@ the line of the code in Annie.java annieController = (CorpusController) PersistenceManager.loadObjectFromFile(annieGapp);
I don't know much about how to resolve this above error, since i don't have much experience in using external JAR like Annie.

I'm getting this error when run on Linux. Please help

Exception in thread "main" java.lang.NoClassDefFoundError: gate/SimpleAnnotation
at java.lang.Class.getDeclaredMethods0(Native Method)
at java.lang.Class.privateGetDeclaredMethods(Class.java:2625)
at java.lang.Class.getMethod0(Class.java:2866)
at java.lang.Class.getMethod(Class.java:1676)
at sun.launcher.LauncherHelper.getMainMethod(LauncherHelper.java:494)
at sun.launcher.LauncherHelper.checkAndLoadMain(LauncherHelper.java:486)
Caused by: java.lang.ClassNotFoundException: gate.SimpleAnnotation
at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
at java.lang.ClassLoader.loadClass(ClassLoader.java:358)

Output

I'd like to change the output format of the .json file. I cannot seem to locate the filewriter though. Can someone point me in the right direction?

Convert multiple resumes simultaneously

Hello Antony. Thanks for the great script. I wonder if there is way to convert multiple (say 300) resumes at the same time in CLI, rather than one by one? Also, for each resume, the parsed field names do not align (one has education, name, gender and the next has gender, skills, education). Is there a way to solve this or do I have to do it in excel VBA when converting JSON to csv?
Thanks!

Support Other langages (French, German, Spanish, etc.)

Hello,
I've been using the solution to parse English CVs and it's working pretty well.
I'm wondering how is it possible to add other langages support ?
I saw that GATE has plugins.
Can you describe where and how to add these plugins for someone who has no idea about GATE ?

Issue running project

Hi,
I have downloaded and installed the project as stated. On running it am getting below error
C:\ResumeParser\ResumeParser-master\ResumeTransducer>java -cp '.\bin*;..\GATEFi
les\lib*;..\GATEFILES\bin\gate.jar;.\lib*' code4goal.antony.resumeparser.Resum
eParserProgram .\UnitTests\AntonyDeepakThomas.pdf antony_thomas.json

Error: Could not find or load main class code4goal.antony.resumeparser.ResumePar
serProgram

Unable to detect correct Employee Name if in CAPS

Hi,
In the attached resume, the parser is not able to find the correct name if in CAPS. However it works fine, if it is in standard casing.

Earlier, it was detecting VITAE as first name. Added it to stop words list. It stopped detecting it as first name.

Added the full name to FULL Name _CAPS list. Didn't work

Nitin.docx

Resource to write good JAPE grammar/java

Can somebody refer me a good resource/tutorial from where I can learn about writing JAPE grammar and the JAVA associated with it.

I am fairly new to java but I have worked on NLP so I have some idea about writing the grammar but I am stuck on the JAVA part.

Also is there any way I can debug the grammar and JAVA.

Name, educational details, technical skills

I tried this scripts with my resume but it not parsed exactly name, educational details and technical skills, how should I added user defined grammar ? How to write to find out 10 digit mobile number something like this 1234567890

Problem trying to get the program working?

Hi tried running the testing programm and type the following statement into the powershell window

java -cp '.\bin\*;..\GATEFiles\lib\*;..\GATEFILES\bin\gate.jar;.\lib\*'
code4goal.antony.resumeparser.ResumeParserProgram he.txt out.json

However, its not working gave such error statement what went wrong? Please help

Initialising basic system...
log4j:WARN No appenders could be found for logger (gate.Gate).
log4j:WARN Please initialize the log4j system properly.
log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
...basic system initialised
Initialising processing engine...
...processing engine loaded
Creating doc for file:/C:/Users/User/ResumeParser/ResumeTransducer/he.txt
Running processing engine...
Sad Face :( .Something went wrong.
java.lang.NullPointerException
        at EmailFinder.EmailRule(file:/C:/Users/User/ResumeParser/JAPEGrammars/EmailFinder.jape:16)
        at gate.jape.RightHandSide.transduce(RightHandSide.java:344)
        at gate.jape.SinglePhaseTransducer.fireRule(SinglePhaseTransducer.java:743)
        at gate.jape.SinglePhaseTransducer.transduce(SinglePhaseTransducer.java:354)
        at gate.jape.MultiPhaseTransducer.transduce(MultiPhaseTransducer.java:188)
        at gate.jape.Batch.transduce(Batch.java:204)
        at gate.creole.Transducer.execute(Transducer.java:166)
        at gate.util.Benchmark.executeWithBenchmarking(Benchmark.java:291)
        at gate.creole.ConditionalSerialController.runComponent(ConditionalSerialController.java:163)
        at gate.creole.SerialController.executeImpl(SerialController.java:157)
        at gate.creole.ConditionalSerialAnalyserController.executeImpl(ConditionalSerialAnalyserController.java:244)
        at gate.creole.ConditionalSerialAnalyserController.execute(ConditionalSerialAnalyserController.java:139)
        at code4goal.antony.resumeparser.Annie.execute(Annie.java:53)
        at code4goal.antony.resumeparser.ResumeParserProgram.loadGateAndAnnie(ResumeParserProgram.java:113)
        at code4goal.antony.resumeparser.ResumeParserProgram.main(ResumeParserProgram.java:274)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.