Giter Site home page Giter Site logo

how to train from scratch about mpscl HOT 9 CLOSED

tfboys-lzz avatar tfboys-lzz commented on May 24, 2024
how to train from scratch

from mpscl.

Comments (9)

TFboys-lzz avatar TFboys-lzz commented on May 24, 2024

Yes,you are right, but don't forger to compute the initial category prototypes based on the pretrained model.

from mpscl.

xiro2 avatar xiro2 commented on May 24, 2024

Yes,you are right, but don't forger to compute the initial category prototypes based on the pretrained model.

I have used your mr_ct prototype provided by your pretrained_model, in the first iteration, i set the category prototypes coefficient m to zero, then in the following iterations, the coefficient is set according to your yml file, is it ok?
The number of classes in my experiments are 5, just like your mmwhs dataset

from mpscl.

TFboys-lzz avatar TFboys-lzz commented on May 24, 2024

This is not recommended. The distribution of various datasets is different, so a prototype built based on my pre-trained model may not be suitable for your dataset. The best method is to pre-train the model based on your own dataset, which won't take much time.

from mpscl.

xiro2 avatar xiro2 commented on May 24, 2024

This is not recommended. The distribution of various datasets is different, so a prototype built based on my pre-trained model may not be suitable for your dataset. The best method is to pre-train the model based on your own dataset, which won't take much time.

Even when i set coefficient m=0 in the first epoch? I thought when I set the coefficient CLASS_CENTER_M=0, then it is a initialization procedure.

CLASS_CENTER_M is in line 19 of scripts/configs/MPSCL_MR2CT.yml

我的英语不好,可能表述不准确,用中文说的话,就是我在第一次迭代让CLASS_CENTER_M这个参数为0,在第二次及之后的迭代,CLASS_CENTER_M这个参数才是follow你配置文件里的值

from mpscl.

TFboys-lzz avatar TFboys-lzz commented on May 24, 2024

不好意思,我不太清楚您这里的第一次迭代是什么意思,是warmup阶段的第一次iteration吗

from mpscl.

xiro2 avatar xiro2 commented on May 24, 2024

不是的,我的意思是在warmup_mr2ct.yml这个配置下先训练出一个模型,然后在mpscl_mr2ct.yml这个配置下加载warmup_mr2ct.yml训练出的模型。
我改变的地方是:我更改了你train_uda.py这个文件,使得第一次迭代中CLASS_CENTER_M这个值为0,在第二次迭代及之后,CLASS_CENTER_M这个值才和你mpscl_mr2ct.yml中设定的值一样,CLASS_CENTER_M是你train_uda.py这个文件中第125行中的m参数。
不知道这样算不算对原型的初始化?

from mpscl.

TFboys-lzz avatar TFboys-lzz commented on May 24, 2024

哦哦了解了,这样算是初始化,但是不太建议这样。因为这样的原型是由一个batch的数据初始化的,可能会存在比较多的噪声,不太准确,建议是在源域forward一遍,计算相应的原型。

from mpscl.

xiro2 avatar xiro2 commented on May 24, 2024

好的,我知道该怎么做了,太感谢你了,很庆幸能遇到你这样的作者,致敬!

from mpscl.

TFboys-lzz avatar TFboys-lzz commented on May 24, 2024

很高兴能帮助到您!祝好!!!

from mpscl.

Related Issues (9)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.