Giter Site home page Giter Site logo

NNLP-IL (National Natural Language Processing plan of Israel)

NNLP-IL is a national initiative for the creation of infrastructure, research and development of advanced capabilities for the advancement of the field of NLP in Hebrew and Arabic.

We know what you're thinking.. (Why in english? 🤦‍♀️) - as for now we have decided english will work best for the NNLP-IL open source community, for more information see NNLP-IL Homepage.

Why Is There a Need for a National Plan?

NLP in Hebrew (and to a lesser extent also in Arabic) is left behind. The major breakthrough that will allow significant use has not yet been made, the cost of fitting and customizing each use case on its own is very high.

The Core Reasons

  • Hebrew and Arabic are difficult languages (rich in morphology), most of the technological development is with morphologically thin languages.
  • Modern language models require vast datasets. The accessible data in Hebrew is very limited.
  • The industry's economic interest in investing in NLP in Hebrew (and to some extent also in Arabic) is limited compared to other common languages, since it is a relatively small market.

Guiding Prinicples

  • Generic framework that will allow fitting and customizing solutions to various applications (without focusing on specific use cases).
  • Open sourced (as much as possible) - Everyone can take part, contribute and use.
  • Break through the data barrier - creating tagged and untagged datasets and make them accessible to the general public.
  • Usability - distributing capabilities through manuals, convenient packaging of code and more.

Who's taking part?

  • You!
  • The Israeli Ministry of Defence Directorate of Defense Research and Development (DDR&D).
  • Israel Innovation Authority.
  • The Ministry of Innovation, Science & Technology.

Active Projects

⭐ Contributing

The main purpose of this repository is to increase the development in Hebrew and Arabic NLP, Making it relevant and easier to use. Read below to learn how you can take part in improving NNLP-IL.

Code of Conduct

Read our Code of Conduct that we expect project participants to adhere to. Please read the full text so that you can understand what actions will and will not be tolerated.

Contributing Guide

Read our Contributing Guide to learn about our development process, how to propose bugfixes and improvements, and how to build and test your changes to NNLP-IL.

License

NNLP-IL is Apache 2.0 licensed.

NNLP-IL's Projects

blog icon blog

Blog of Hebrew NLP news future events and much more.

hebnli icon hebnli

Dataset for NLI tasks in Hebrew.

nnlp-il icon nnlp-il

A national initiative for the creation of infrastructure, research and development of advanced capabilities for the advancement of the field of NLP in Hebrew and Arabic.

parashoot-tagging icon parashoot-tagging

A web-based annotator for closed-domain question answering datasets with SQuAD format.

stop-words-hebrew icon stop-words-hebrew

List of stop words in Hebrew produced by using Universal Dependencies of the The Israeli Association of Human Language Technologies (IAHLT)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.