Giter Site home page Giter Site logo

Running on Office31 about adda HOT 6 OPEN

RayWangWR avatar RayWangWR commented on July 1, 2024
Running on Office31

from adda.

Comments (6)

RayWangWR avatar RayWangWR commented on July 1, 2024

I am adaption from amazon to webcam. During training, the validation accuracy on webcam first increased a little bit, then it began to drop very fast after about 20 iterations.

from adda.

LamForest avatar LamForest commented on July 1, 2024

One possible solution is to use a small lr for target encoder(e.g. 1e-5) and reduce training epoch.
My setting:
ResNet-50 for source and target encoder.
lr for source encoder : 1e-3 this doesnot matter
lr for discriminator: 1e-3
lr for target encoder: 1e-5
trainin epoch : 6

from adda.

LamForest avatar LamForest commented on July 1, 2024

BTW, I use https://github.com/corenel/pytorch-adda not this offical Tensorflow. My result (A->W) is 81% using above setting.

from adda.

RayWangWR avatar RayWangWR commented on July 1, 2024

Hi Tianlin-Gao,

Thanks for your reply. May I know which layer are you matching and do you fix some layers in the source/target encoder to avoid overfitting? Also, can you tell me the number/size of the hidden layer of your discriminator and which optimizer are you using, parameters for the optimizer? Or would you mind sharing your code for adda on Office?

I am sorry that this has been killing me for days. I just want to get adda work and build my model based on it. I would be really thankful if you can provide more information on that implementation.

Thanks,
Rui
[email protected]

from adda.

RayWangWR avatar RayWangWR commented on July 1, 2024

from adda.

LamForest avatar LamForest commented on July 1, 2024

Hi Tianlin-Gao,

Thanks for your reply. May I know which layer are you matching and do you fix some layers in the source/target encoder to avoid overfitting? Also, can you tell me the number/size of the hidden layer of your discriminator and which optimizer are you using, parameters for the optimizer? Or would you mind sharing your code for adda on Office?

I am sorry that this has been killing me for days. I just want to get adda work and build my model based on it. I would be really thankful if you can provide more information on that implementation.

Thanks,
Rui
[email protected]

Sorry for the delay.
I use ResNet50 for source encoder and target encoder. For both of them, layers prior to Res4a are freezed. Target encode is initialized from source encoder. The output of avgpooling is used as the input for discriminator. The discriminator has 2 hidden layer. Each of them has 500 units.
The pytorch code for discriminator is as follow:

input_dims = 2048
hidden_dims = 500
output_dims = 2
         self.layer = nn.Sequential( 
             nn.Linear(input_dims, hidden_dims),
             nn.ReLU(),
             nn.Linear(hidden_dims, hidden_dims),
             nn.ReLU(),
             nn.Linear(hidden_dims, output_dims),
             nn.LogSoftmax()
         )

Discriminator is from scratch. Source encoder is initialized from ImageNet ResNet50

from adda.

Related Issues (18)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.