Giter Site home page Giter Site logo

Comments (2)

formiel avatar formiel commented on August 20, 2024

Hi @keloemma,

Question 1, 2, and 3: The goal of the script extract_split_cls.py is to prepare data in the proper format for the fine-tuning script. In particular, it extracts the text, get the labels, and split the training set into train and dev sets since there is no dev set provided separately for this task. So you should modify or create your own script to process your data if your data is different from the CLS dataset.

The format of the output files depend on the task. If you use Hugging Face's transformer library for finetuning, you can check out the details of the output format here if your task is similar to the ones in the GLUE benchmark and prepare your data accordingly.

There is also a "parse.py" script , At which level is it used in the flow of the architecture ?

It is included in the downloaded data but we do not use it in our code.

About the finetuning, what is the utility of the config file ? How is it obtained ?

The configuration file is used to save different training parameters. You can run your experiments with different configurations using the same running command. For more details, you can check out the configuration files in the examples and see the parameters. These parameters are used in the running command.

from flaubert.

schwabdidier avatar schwabdidier commented on August 20, 2024

I assume that you got your answer @keloemma ?

from flaubert.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.