Comments (4)
Yes, I have also observed the phenomenon. That's why I have rolled back to cuDNN v2.
cuDNN v2 has also been unstable until rc3. So I tend to wait NVIDIA's engineers optimizing it for a few months.
from caffe-windows.
Actually, cuDNN v3 have provided various algorithms for convolution. In layer setup step, convolution layer will automatically select one algorithm based on the GPU memory. However, this feature is not exploited well. You may try to force caffe using some specified algorithms for forward and backward, and test how much time they would cost.
I have already done some experiment on my GTX 780. Unfortunately, none combination of algorithms performs better than cuDNN v2. I guess cuDNN v3 is just made for Maxwell architecture GPUs. You may try by yourself. I am looking forward for your results to check if this is a windows problem or just due to the unmature cuDNN.
from caffe-windows.
Thanks for your reply. I just tried it on my GTX980 which is Maxwell architecture GPU, and found it much slower than gpu alone (windows 7). I already switched back to cuDNN v2 which is marginally faster than v1 with my data.
from caffe-windows.
This problem has been solved in the most recent version of cudnn. I have updated the 3rdparty libarary package and the codes. Please check.
from caffe-windows.
Related Issues (20)
- how to use Contrastive/Triplet loss layer? Is there any train_val.ptototxt example?
- Accessing .\windows
- building caffe issue fatal error LNK1104 cannot open file python36.lib HOT 6
- can not compile in the windows
- python3.5报错LINK : fatal error LNK1104: 无法打开文件“python36.lib” HOT 2
- Can not find the prototxt including the LabelSpecificAdd
- fatal error: caffe/proto/caffe.pb.h: No such file or directory HOT 1
- softmax_loss = 0 ,under ubuntu16.04,cuda10,opencv 3.4.5
- caffe_pb2.py has no LabelMap() HOT 1
- win10+cuda10+cudnn7.3编译出现MSB271错误? HOT 3
- 编译出现:InitLogLevelPipe错误
- python3.5
- Compile on ubuntu
- mnist demo failed
- compile error on win10 + VS2017 + cuda9
- after deleting a Net instance, the GPU memory is not released
- 求助大佬,win7 64位,vs2015+cuda10.2 环境下编译需要如何修改啊?
- 0 projects, projects or modules unavailable
- cuda11.1+win10+vs2015+rtx3090 NetParameter problem
- libcaffe编译成功,pycaffe 编译失败
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from caffe-windows.