Giter Site home page Giter Site logo

Comments (4)

zRzRzRzRzRzRzR avatar zRzRzRzRzRzRzR commented on August 10, 2024

是,所有的output都是计算loss的,但是关于对话排列中,您的图好像不对
例如,对于输入 ,模型生成 ,计算生成的 与实际的 的损失。
再比如,输入 ... ,模型生成新的 ,计算生成的新的 与实际新的 的损失。而之前所有的内容都是input

from glm-4.

RyanOvO avatar RyanOvO commented on August 10, 2024

是,所有的output都是计算loss的,但是关于对话排列中,您的图好像不对 例如,对于输入 ,模型生成 ,计算生成的 与实际的 的损失。 再比如,输入 ... ,模型生成新的 ,计算生成的新的 与实际新的 的损失。而之前所有的内容都是input

是这样的么?第一种:

image

还是这样的?第二种:

image

看代码真没看出来,前面的Input+output,怎么又变成了新的Input;即你说的之前所有的内容都是Input。
@zRzRzRzRzRzRzR

from glm-4.

zRzRzRzRzRzRzR avatar zRzRzRzRzRzRzR commented on August 10, 2024

是第一种,第二种是没办法训练多轮对话的呀

from glm-4.

RyanOvO avatar RyanOvO commented on August 10, 2024

好的,清楚了。第一种的弊端就是比较冗余且性能低一些。
不过对于第二张Loss计算方式是可以的。在Firefly与Xtuner两个微调框架中都支持了。可参考文档:
https://github.com/InternLM/xtuner/blob/main/docs/zh_cn/user_guides/dataset_format.md
@zRzRzRzRzRzRzR

from glm-4.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.