Giter Site home page Giter Site logo

关于voxceleb dino about 3d-speaker HOT 14 CLOSED

JINzezhong7 avatar JINzezhong7 commented on July 30, 2024
关于voxceleb dino

from 3d-speaker.

Comments (14)

yfchenlucky avatar yfchenlucky commented on July 30, 2024

rdino的训练初期eer是超过10%的,继续训练就好。

from 3d-speaker.

JINzezhong7 avatar JINzezhong7 commented on July 30, 2024

但是我第一轮是19%,然后到第9轮还有14%。这个正常吗
image

from 3d-speaker.

JINzezhong7 avatar JINzezhong7 commented on July 30, 2024

我用的是dino,没有用rdino

from 3d-speaker.

yfchenlucky avatar yfchenlucky commented on July 30, 2024

看趋势是正常范围,可以继续训练,一般25epochs之后,EER会降低到5%以下。

from 3d-speaker.

JINzezhong7 avatar JINzezhong7 commented on July 30, 2024

感谢您的回答。我还有两个问题,在mlp那里三层的fc每一层后面都加了bn,这个我没有加影响大吗。还有
image
这里我也没有加,请问这里是在做什么。

from 3d-speaker.

yfchenlucky avatar yfchenlucky commented on July 30, 2024
  1. 前两个fc后最好加入bn,没有严格对比过缺失对训练影响。
  2. 图中支持多卡batchnorm。

from 3d-speaker.

JINzezhong7 avatar JINzezhong7 commented on July 30, 2024

好的谢谢,我发现经过mlp之后得到的是65536dim的向量,然后放入softmax里算分布。为什么最后要选择这么大的一个向量,这是有什么实验证明吗

from 3d-speaker.

yfchenlucky avatar yfchenlucky commented on July 30, 2024

参照DINO原文中实验配置,我们实验发现如果大幅缩小dim维度,性能会大幅降低。

from 3d-speaker.

JINzezhong7 avatar JINzezhong7 commented on July 30, 2024

非常感谢您的回答

from 3d-speaker.

yfchenlucky avatar yfchenlucky commented on July 30, 2024

不客气,期待您进一步的研究。

from 3d-speaker.

JINzezhong7 avatar JINzezhong7 commented on July 30, 2024
image 您好,我目前训到19轮 趋势EER 是在降低,但是很慢,并且我发现loss从14 epoch开始升高,这是因为没有加batchnorm吗在head层

from 3d-speaker.

yfchenlucky avatar yfchenlucky commented on July 30, 2024

是存在可能的,或者是因为你修改代码失误,建议可以先跑完整的源码。

from 3d-speaker.

JINzezhong7 avatar JINzezhong7 commented on July 30, 2024

我刚刚踏入这个领域,还想问一个简单的问题,dino在保存模型的时候是保存student model还是teacher model。最后在测试的时候是用teacher model还是student model进行测试呢

from 3d-speaker.

yfchenlucky avatar yfchenlucky commented on July 30, 2024

都保存,测试使用teacher model,详情见代码哈~

from 3d-speaker.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.