Giter Site home page Giter Site logo

xjtushujun / meta-weight-net Goto Github PK

View Code? Open in Web Editor NEW
274.0 7.0 66.0 2.91 MB

NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for noisy labels).

License: MIT License

Python 100.00%
meta-learning sample-reweighting noisy-labels class-imbalance

meta-weight-net's People

Contributors

xjtushujun avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

meta-weight-net's Issues

有关GPU的问题

老师你好,请问这个Meta-resnet和Meta-vnet可以用多块GPU吗

how to draw the accuracy curve

Hi,I have a question When I repeated the experiment in your paper. I was not clear how to draw the accuracy curve of the training set.
Thanks.

About the effectiveness

Thank you for your excellent work!

Here, we raised some questions about the effectiveness (time & memory) of Meta-Weight-Net:

Compared to Learn-to-Reweight (Ren, 2017), how about the cost of running time and GPU memory per training step?

Can the training process's time efficiency be improved by updating Meta-Weight-Net every several steps (rather than updating every step)? Will this affect model's performance?

Is it possible to achieve multi-GPU parallelism (based on Pytorch)?

Thanks very much~

About the details of learning rate

There is a sentence in the appendix: "With batch normalization, we effectively cancel the learning rate of Meta-Weight-Net, and it works well with a fixed learning rate. "

I'm not sure what it is about. Would you please give an explanation in detail? Does it mean we don't need to fine-tune the learning rate of meta networks because of BN?

tabular data/ noisy instances

Hi,
thanks for sharing your implementation. I have two questions about it:

  1. Does it also work on tabular data?
  2. Is it possible to identify the noisy instances (return the noisy IDs or the clean set)?

Thanks!

关于模型的问题

首先,非常感谢你们开源这么优秀的工作!

我有一个疑惑,为什么作者重写了整个模型。

基于MetaModule重写了所有卷积,线性和批量归一化的目的是什么呢?为什么不直接使用torch.nn模块里的模型(如,nn.Conv2d),就如同pytorch官方实现的resnet等。

数据集划分

请问实验中用到的cifar-10和-100,是怎么划分训练集或者验证集的呢?文中的meta-learning是用的episodic方式将数据组织成n-way-k-shot的吗?文中的实验结果是多少n,多少k呢?能告知一下吗?

The accuracy of BaseModel is 88.5 when the noise rate is 0.4?

I trained a WRN-28-10 network on cifar-10 with noise rate of 0.4 under uniform noise following the setting in the paper for a total of 40 epochs, but the accuracy of BaseModel is 88.5, which is really high compared with the results in Table 2. I don't know what the problem is.

请问本文的方法和L2RW的比较

文章提到L2RW "might lead to unstable weighting behavior during training and unavailability for generalization"
请问这是为什么?依据是什么?

关于baseline精度问题

请问baseline 的参数设置和其他方法的设置一样吗,为什么baseline在噪声40%的结果偏高呢,大约0.8左右

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.