Giter Site home page Giter Site logo

evgeneus / label-smoothing-in-text-classification Goto Github PK

View Code? Open in Web Editor NEW
8.0 2.0 0.0 116.13 MB

Soft Target and Label Smoothing in Text Classification for Probability Calibration of Output Distributions.

Python 61.59% Jupyter Notebook 38.41%
label-smoothing soft-targets crowd-votes nlp transformer label-distribution machine-learning text-classification document-management calibration

label-smoothing-in-text-classification's Introduction

An Empirical Study on Soft Target and Label Smoothing in Text Classificationwith Crowd Annotations

Abstract

Recent research on label smoothing has shown that using label distribution as the training target can enhance both model performance and probability calibration. In the context of crowdsourcing, the empirical distribution over crowd labels, namely the soft target, offers a potentially better distribution than that obtained by label smoothing; however, the effect of soft targets on text classification remains unknown and how to best leverage crowd labels for creating label distributions remains largely unexplored. This paper introduces two generic soft target methods that can incorporate any label fusion methods for noise reduction and presents a systematic evaluation on 13 real-world datasets to understand the effect of both soft targets and label smoothing on text classification. We show that soft targets are a better approach than label smoothing especially to train well-calibrated models and that our proposed methods substantially improve model performance and probability calibration across datasets of different noise levels.

/data/

This folder contains the datasets used in this paper:

  1. datasets from Figure Eight (from-figure-eight) that contain already predefined label distribution and gold test sets that annotated by our team.
  2. datasets with the actual crowd votes (datasets-with-crowd-votes).

More detailed description of the datasets could be found in corresponding folders in README.md

/res/

This folder contains the details of our hyperparameter search, summary of results that includes ECE figures, F_1, F_01, F_10, ECE score, as well as Precision and Recall.

/scr/transformers/

This folder contains the implementation of DistilBert model for Soft/Hard/sHard/Label smoothing training

/scr/nnets/

This folder contains the implementation of simple neural network for Soft/Hard/sHard/Label smoothing training

label-smoothing-in-text-classification's People

Contributors

evgeneus avatar shuaixiaoqiang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.