Comments (6)
I am adaption from amazon to webcam. During training, the validation accuracy on webcam first increased a little bit, then it began to drop very fast after about 20 iterations.
from adda.
One possible solution is to use a small lr for target encoder(e.g. 1e-5) and reduce training epoch.
My setting:
ResNet-50 for source and target encoder.
lr for source encoder : 1e-3 this doesnot matter
lr for discriminator: 1e-3
lr for target encoder: 1e-5
trainin epoch : 6
from adda.
BTW, I use https://github.com/corenel/pytorch-adda not this offical Tensorflow. My result (A->W) is 81% using above setting.
from adda.
Hi Tianlin-Gao,
Thanks for your reply. May I know which layer are you matching and do you fix some layers in the source/target encoder to avoid overfitting? Also, can you tell me the number/size of the hidden layer of your discriminator and which optimizer are you using, parameters for the optimizer? Or would you mind sharing your code for adda on Office?
I am sorry that this has been killing me for days. I just want to get adda work and build my model based on it. I would be really thankful if you can provide more information on that implementation.
Thanks,
Rui
[email protected]
from adda.
from adda.
Hi Tianlin-Gao,
Thanks for your reply. May I know which layer are you matching and do you fix some layers in the source/target encoder to avoid overfitting? Also, can you tell me the number/size of the hidden layer of your discriminator and which optimizer are you using, parameters for the optimizer? Or would you mind sharing your code for adda on Office?
I am sorry that this has been killing me for days. I just want to get adda work and build my model based on it. I would be really thankful if you can provide more information on that implementation.
Thanks,
Rui
[email protected]
Sorry for the delay.
I use ResNet50 for source encoder and target encoder. For both of them, layers prior to Res4a are freezed. Target encode is initialized from source encoder. The output of avgpooling is used as the input for discriminator. The discriminator has 2 hidden layer. Each of them has 500 units.
The pytorch code for discriminator is as follow:
input_dims = 2048
hidden_dims = 500
output_dims = 2
self.layer = nn.Sequential(
nn.Linear(input_dims, hidden_dims),
nn.ReLU(),
nn.Linear(hidden_dims, hidden_dims),
nn.ReLU(),
nn.Linear(hidden_dims, output_dims),
nn.LogSoftmax()
)
Discriminator is from scratch. Source encoder is initialized from ImageNet ResNet50
from adda.
Related Issues (18)
- Adversarial loss HOT 4
- Results not so stable in SVHN to MNIST HOT 2
- we can't download usps datasets HOT 2
- Running code out-of-the-box gives low accuracy HOT 5
- Appropriate layer to adapt HOT 3
- Error HOT 1
- ImportError: cannot import name ExitStack HOT 1
- Custom Dataset
- No module named 'tensorflow.contrib.learn.python.learn.dataframe.queues.feeding_queue_runner HOT 3
- Error: Missing argument "SOURCE".
- Code for the paper "SPLAT: Semantic Pixel-Level Adaptation Transforms for Detection"
- Why my script would stop running without any error report? HOT 1
- vgg16 did not run
- Why put the results of the entire network into ADDA instead of the feature extraction output?
- wish you can share resnet-50 code
- AttributeError: 'MNIST' object has no attribute 'ignore_labels'
- Hyperparameters used when adapting usps1800 to mnist 2000 and vice versa HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from adda.