Giter Site home page Giter Site logo

helloML's Projects

albert_zh icon albert_zh

A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型

bpemb icon bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

c2c-da icon c2c-da

Code for the AAAI-2021 paper: C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot Filling

china_area icon china_area

2021年**全国5级行政区划(省、市、县、镇、村)

daga icon daga

Data Augmentation with a Generation Approach for Low-resource Tagging Tasks

hanlp icon hanlp

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

kashgari icon kashgari

Simple, Keras-powered multilingual NLP framework, allows you to build your models in 5 minutes for named entity recognition (NER), part-of-speech tagging (PoS) and text classification tasks. Includes BERT and word2vec embedding.

knover icon knover

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

latticelstm icon latticelstm

Chinese NER using Lattice LSTM. Code for ACL 2018 paper.

mixtext icon mixtext

MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification

mmsa icon mmsa

CH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotations of Modality (ACL2020)

negsampling-ner icon negsampling-ner

Negative sampling for solving the unlabeled entity problem in NER. ICLR-2021 paper: Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition.

polyencoder icon polyencoder

An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring)

qa_match icon qa_match

A simple effective ToolKit for short text matching

revisit-bert-finetuning icon revisit-bert-finetuning

For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).

roberta_zh icon roberta_zh

RoBERTa中文预训练模型: RoBERTa for Chinese

sogoumrctoolkit icon sogoumrctoolkit

This toolkit was designed for the fast and efficient development of modern machine comprehension models, including both published models and original prototypes.

stopwords icon stopwords

中文常用停用词表(哈工大停用词表、百度停用词表等)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.