Giter Site home page Giter Site logo

eth-library-lab / indexda Goto Github PK

View Code? Open in Web Editor NEW
11.0 2.0 2.0 17.18 MB

Natural Language Processing of academic papers for dataset indexing

License: MIT License

Python 99.32% Shell 0.68%
sciencedirect academic-papers natural-language-processing scraper python bert bert-model classification classifier-model

indexda's Introduction


Welcome ๐Ÿ‘‹

Our mission is to help students, researchers and educators unleash their full potential by boosting ideas that support discovering, accessing, using or sharing scientific information and knowledge. See more on our website.



Projects on Github ๐Ÿ’ป ๐Ÿ“š

We are focussed on bringing projects to a prototype or MVP level at which point they can be shared, utilised and seek support for further development.

Some repos will be ready to reproduce and run end-to-end, others may be in the early experimental development stages. If you are interested in discussing any of our projects please get in touch via our contact form.

Not all of our projects are listed here on Github, so please check out our website for a list of all Library Lab projects

Filsat

repo eth-library-lab/filsat
about A transition platform for open source code and online coding tutorials.
tech Stencil, GraphQL, NodeJS, Django
status finished ๐Ÿ

inDexDa

repo eth-library-lab/inDexDa
about Automated identification and indexing of datesets in academic papers using NLP
tech NLP, webscraping, Tensorflow, BERT,
status finished ๐Ÿ

BioDex

repo eth-library-lab/biodex--mobile-app
about Automated species identification of butterflies with a mobile app designed for Natural History Museums
tech React Native, Django
status finished ๐Ÿ

Image Search

repo eth-library-lab/open-image-search
about Image retrieval tool used to search for similar images in an archive and find relevant metadata
tech Vue, Django, Tensorflow, Docker
status active ๐Ÿƒ

herbaria--plant-labeling

repo eth-library-lab/herbaria--plant-labeling
about Image segmentation model for labelling plant structures in herbarium samples in the family Brassicaceae
tech Tensorflow, MaskRCNN
status exploratory ๐Ÿ”ฌ

indexda's People

Contributors

barrysunderland avatar parkerewen5441 avatar peterpeterparker avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

iamlaom

indexda's Issues

Not able to run InDexDa - "FileNotFoundError"

Hi Parker,

Installation was ok, I was able to follow the documentation and checkout/install the project locally, all good ๐Ÿ‘

On the other side, when I try to run the project, with or without tweaking the args.json, I face the following error when I run the command python3 run.py --first_time --scrape --train:

Traceback (most recent call last):
File "run.py", line 48, in
preprocess.processForTrainingBert()
File "/Users/daviddalbusco/projects/lab/datadex/NLP/utils/preprocess.py", line 209, in > processForTrainingBert
self.makeDirs()
File "/Users/daviddalbusco/projects/lab/datadex/NLP/utils/preprocess.py", line 127, in makeDirs
os.mkdir(os.path.join(self.current_dir, '../../data/bert_datatrain/0'))
FileNotFoundError: [Errno 2] No such file or directory: '/Users/daviddalbusco/projects/lab/datadex/NLP/utils/../../data/bert_datatrain/0'

Not rush, definitely something we could try out to solve, if needed, together in January next time we met.

Have a good one
David

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.