Hi, Dear: Thank you very much for your code. I am reproducing your t

<a href="https://github.com/ise-uiuc/magicoder/files/14047808/Magicoder-S

Are the training loss and validation loss recorded? about magicoder HOT 4 CLOSED

ise-uiuc commented on September 23, 2024

Are the training loss and validation loss recorded?

from magicoder.

Comments (4)

shatealaboxiaowang commented on September 23, 2024

Thank you for your open source, I am replicating your fine-tuning process according to the code on github. Do the results of train loss=0.16 and eval_loss=0.21 I trained on the 75k dataset match yours? I will continue training on the 110k dataset.
I trained for 4 epochs and indeed started overfitting after the second epoch.

from magicoder.

UniverseFly commented on September 23, 2024

Magicoder-S-CL.json
Magicoder-CL.json
Magicoder-S-DS.json
Magicoder-DS.json

Hi, here are the trainer states. Hope they can help!

from magicoder.

shatealaboxiaowang commented on September 23, 2024

Magicoder-S-CL.json Magicoder-CL.json Magicoder-S-DS.json Magicoder-DS.json

Hi, here are the trainer states. Hope they can help!

Thank you very much, my training process is basically the same as your loss metric and the test results on humaneval dataset are basically consistent

But I have a question. Why is the model fully fine-tuned by instruction, but why is the capacity infilinig increased?
I look forward to hearing from you.

from magicoder.

UniverseFly commented on September 23, 2024

Good to hear you can reproduce it. Yeah we did observe that the infilling capability was at least not decreasing. We believe this is because the model learned some general alignment during the instruction tuning, and infilling is a kind of alignment based on the surrounding context. Further study of this phenomenon would be interesting.

from magicoder.

Recommend Projects

Are the training loss and validation loss recorded? about magicoder HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent