Topic: nlp-datasets Goto Github
Some thing interesting about nlp-datasets
Some thing interesting about nlp-datasets
nlp-datasets,Open Finnish NLP datasets
User: aajanki
Home Page: https://aajanki.github.io/finnish-nlp-datasets/
nlp-datasets,AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/
Organization: afrisenti-semeval
nlp-datasets,A list of Romanian NLP Datasets
User: andythefactory
nlp-datasets,1st Place solution for the SAS | GIM Bitathon, an annual Data Science Hackathon organized by SAS and Goa Institute of Management. The dataset worked on is the subset of the consumer complaints database provided by www.consumerfinance.gov
User: aryashah2k
nlp-datasets,NERO-nlp is a PyPI package for biomedical Named Entity (Recognition) Ontology
User: bohdan-khomtchouk
Home Page: https://pypi.org/project/NERO-nlp
nlp-datasets,Bothub is an open platform for predicting, training and sharing NLP datasets in multiple languages
Organization: bothub-it
Home Page: https://bothub.it
nlp-datasets,Implementation of Very Deep Convolutional Neural Network for Text Classification
User: cjiang2
nlp-datasets, Library for generation of russian names
User: cybermatt
nlp-datasets,📚 A small collection of Russian literature 📚
User: d0rj
nlp-datasets,This repository contains the official code for the paper : Realistic Data Augmentation Framework for Enhancing Tabular Reasoning.
User: dibyakanti
Home Page: https://autotnli.github.io/
nlp-datasets,Generate large textual corpora for almost any language by crawling the web
User: divkakwani
nlp-datasets,Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по орфографическим ошибкам и опечаткам.
User: dkulagin
nlp-datasets,A collection of datasets for Ukrainian language
Organization: fido-ai
Home Page: https://fido-ai.github.io/ua-datasets/
nlp-datasets,Extracts Transcript and Summary (Abstractive and Extractive) from the AMI Meeting Corpus
User: gcunhase
nlp-datasets,Comprehensive evaluation framework for Open Information Extraction.
User: gkiril
Home Page: https://aclanthology.org/2022.acl-long.307/
nlp-datasets,a small test dataset for use with OpenAI's ChatGPT
User: gpt-tester
nlp-datasets,UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Organization: grammarly
Home Page: https://ua-gec-dataset.grammarly.ai/
nlp-datasets,chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
User: guhhhhaa
nlp-datasets,chinese NLP corpus of chinese science fiction, chinese science fiction corpus: Archive of the Ark Plan of Ula Science Fiction Website 乌拉科幻小说网方舟计划存档,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
User: guhhhhaa
nlp-datasets,multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.
Organization: hellohaptik
Home Page: https://multi-task-nlp.readthedocs.io/en/latest/
nlp-datasets,A Constrained Text Generation Challenge Towards Generative Commonsense Reasoning
Organization: ink-usc
Home Page: http://inklab.usc.edu/CommonGen/
nlp-datasets,RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge
Organization: ink-usc
Home Page: https://inklab.usc.edu/RiddleSense/
nlp-datasets,TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)
Organization: ink-usc
Home Page: https://arxiv.org/abs/2004.07493
nlp-datasets,Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"
Organization: ink-usc
Home Page: https://inklab.usc.edu/XCSR/
nlp-datasets,Resource NLP & Bahasa
User: irfnrdh
nlp-datasets,A Python library designed for scraping data from the SCP wiki.
User: jadynhax
Home Page: https://pypi.org/project/scpscraper/
nlp-datasets,English loanwords in Japanese
User: jamesohortle
nlp-datasets,An annotated Chinese metaphor dataset
User: jasonshao55
nlp-datasets,The release of the FreebaseQA data set (NAACL 2019).
User: kelvin-jiang
nlp-datasets,a Fine-tuned LLaMA that is Good at Arithmetic Tasks
User: liutiedong
nlp-datasets,The E2E Dataset, packed as a PyTorch DataSet subclass
User: marco-roberti
nlp-datasets,WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000+ "why" question-answer-rationale triplets.
User: matt-seb-ho
nlp-datasets,datasets with text data for use in NLP, Text analysis, information extraction, ML research.
Organization: maxent-ai
nlp-datasets,Official github repository: Battle of the Wordsmiths: Comparing ChatGPT, GPT-4, Claude, and Bard (dataset)
User: mehrdad-dev
nlp-datasets,curated collection of papers for the nlp practitioner 📖👩🔬
User: mihail911
nlp-datasets,Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.
User: minixc
nlp-datasets,A Typed Event-Focused Lexical Inference Benchmark for Evaluating Natural Language Inference
User: mnschmit
nlp-datasets,This project is submitted as python implementation in the contest of Analytics Vidhya called "Identify the Sentiments". I enjoyed the joining of this competition and all its process. This submited solution got the rank 118 in the public leaderboard.
User: mtala3t
Home Page: https://datahack.analyticsvidhya.com/contest/linguipedia-codefest-natural-language-processing-1/
nlp-datasets,Yorùbá language training text for NLP, ASR and TTS tasks
Organization: niger-volta-lti
nlp-datasets,Arabic Dictionaries
User: osintai
nlp-datasets,Code and data for "Summarising Historical Text in Modern Languages" (EACL 2021)
User: pzoom522
nlp-datasets,Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
User: quincyliang
nlp-datasets,汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。
User: secsilm
nlp-datasets,Turkish writings dataset that promotes creativity, content, composition, grammar, spelling and punctuation.
User: selimfirat
Home Page: https://stars.bilkent.edu.tr/turkce/
nlp-datasets,The Mueller Report Corpus V 0.1
Organization: semiringinc
nlp-datasets,Simplifying parsing of large jsonline files in NLP Workflows
User: trisongz
Home Page: https://pypi.org/project/pylines/
nlp-datasets,Reading the data from OPIEC - an Open Information Extraction corpus
Organization: uma-pi1
Home Page: https://www.uni-mannheim.de/dws/research/resources/opiec/
nlp-datasets,Implementation of the semi-structured inference model in our ACL 2020 paper, INFOTABS: Inference on Tables as Semi-structured Data.
Organization: utahnlp
Home Page: https://infotabs.github.io/
nlp-datasets,手工整理医疗行业词汇、术语等语料。可用于语音识别、对话系统等各类nlp模型训练。
User: xtea
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.