hellomlwo,helloML,github

albert_zh

A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型

amazon-weak-ner-needle

Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data

bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

c2c-da

Code for the AAAI-2021 paper: C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot Filling

chinese-text-classification-pytorch

中文文本分类，TextCNN，TextRNN，FastText，TextRCNN，BiLSTM_Attention，DPCNN，Transformer，基于pytorch，开箱即用。

daga

Data Augmentation with a Generation Approach for Low-resource Tagging Tasks

data-augmentation-coling2020

Code accompanying Coling2020 publication on data augmentation for named entity recognition

hanlp

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

Simple, Keras-powered multilingual NLP framework, allows you to build your models in 5 minutes for named entity recognition (NER), part-of-speech tagging (PoS) and text classification tasks. Includes BERT and word2vec embedding.

knover

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

latticelstm

Chinese NER using Lattice LSTM. Code for ACL 2018 paper.

mixtext

MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification

mmsa

CH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotations of Modality (ACL2020)

negsampling-ner

Negative sampling for solving the unlabeled entity problem in NER. ICLR-2021 paper: Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition.

neuralnlp-neuralclassifier

An Open-source Neural Hierarchical Multi-label Text Classification Toolkit

nlpaug

Data augmentation for NLP

polyencoder

An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring)

qa_match

A simple effective ToolKit for short text matching

revisit-bert-finetuning

For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).

roberta_zh

RoBERTa中文预训练模型: RoBERTa for Chinese

sentence-transformers

Sentence Embeddings with BERT & XLNet

simbert_pytorch

sogoumrctoolkit

This toolkit was designed for the fast and efficient development of modern machine comprehension models, including both published models and original prototypes.

stopwords

中文常用停用词表（哈工大停用词表、百度停用词表等）

tensorflow-examples

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

hellomlwo Goto Github PK

helloML's Projects

Recommend Projects

Recommend Topics

Recommend Org