Comments (12)
I have switched the pytorch version to 1.0.0 by pip install torch==1.0,and still segmentation fault.
from fastfcn.
好像是我外部编译的时候用的c++,pytorch编译用的g++,我咋用g++编译外部应用呢
from fastfcn.
$ c++ -v
Using built-in specs.
COLLECT_GCC=c++
COLLECT_LTO_WRAPPER=/usr/libexec/gcc/x86_64-redhat-linux/4.8.2/lto-wrapper
Target: x86_64-redhat-linux
Configured with: ../configure --prefix=/usr --mandir=/usr/share/man --infodir=/usr/share/info --with-bugurl=http://bugzilla.redhat.com/bugzilla --enable-bootstrap --enable-shared --enable-threads=posix --enable-checking=release --with-system-zlib --enable-__cxa_atexit --disable-libunwind-exceptions --enable-gnu-unique-object --enable-linker-build-id --with-linker-hash-style=gnu --enable-languages=c,c++,objc,obj-c++,fortran,lto --enable-plugin --enable-initfini-array --without-isl --without-cloog --with-tune=generic --with-arch_32=i686 --build=x86_64-redhat-linux
Thread model: posix
gcc version 4.8.2 20131212 (Red Hat 4.8.2-8) (GCC)
from fastfcn.
I do not think it's caused by g++
.
Can you install PyTorch
with conda
?
from fastfcn.
from fastfcn.
@wuhuikai Sorry to bother you again,but after install pytorch by conda,i stil have this problem,and i cant figure out the why this happen.Actually,i have run this project successful in another machine and got better scores than paper without any modify,FastFCN is a good project,i want to use it to make more good work.
from fastfcn.
/home/yulu/anaconda3/envs/fastfcn/lib/python3.6/site-packages/torch/nn/_reduction.py:49: UserWarning: size_average and reduce args will be deprecated, please use reduction='mean' instead.
warnings.warn(warning.format(ret))
Using poly LR Scheduler!
Starting Epoch: 0
Total Epoches: 80
0%| | 0/312 [00:00<?, ?it/s]
=>Epoches 0, learning rate = 0.0010, previous best = 0.0000
Segmentation fault
from fastfcn.
Can you compare the environment (versions of gcc
, python
, pytorch
, cuda
, cudnn
, GPU
, GPU Driver
) between the two machines?
In my three machines, all work well except that sometimes I need to run it twice to get rid of segmentation fault
.
from fastfcn.
I have passed the different compiler problem by setting export CXX=g++, but segmentation fault still.
from fastfcn.
One possible reason is the wrong gcc
version. In my machines, gcc==5.5.x
You have mentioned that you successfully ran the program on another machine, is it Ubuntu
or RedHat
?
from fastfcn.
Yes,it is ubuntu with gcc=5.5,mybe the cause of segmentation fault is the version of gcc
from fastfcn.
@Anikily The code in branch latest
is now ready, which uses official SyncBatchNorm
and can run on all OS with PyTorch>=1.1.0
.
from fastfcn.
Related Issues (20)
- Pre-training weights HOT 1
- What do I need change in code to run CPU testing without using CUDA
- What do I need change in code to run "test" script only on CPU without using CUDA?
- I can't successfully run train script on my dataset. HOT 1
- "IndexError: list index out of range" durinng "test" and "test single image" scipt HOT 3
- Need your suggestions HOT 5
- FastFCN has been supported by MMSegmentation. HOT 9
- In order to run this on a different dataset say CamVid what changes need to be made HOT 2
- RuntimeError: Failed downloading url https://hangzh.s3.amazonaws.com/encoding/models/resnet50-ebb6acbb.zip HOT 3
- RuntimeError: => no checkpoint found at 'encnet_jpu_res50_pcontext.pth.tar' HOT 1
- Why use seperable convolutions with dilation factor? HOT 1
- Run Time error when trying to train AdeK20 dataset HOT 1
- What batch size and learning rate would you recommend training for a 24gb GPU?
- Runtime error wile downloading pretrained model
- About the PContext precision results HOT 1
- 训练时出现 raise RuntimeError("{} is a zip archive (did you mean to use torch.jit.load()?)". HOT 5
- 不能下载 HOT 2
- about syncbn HOT 3
- latest版本训练的时候卡住 HOT 1
- How could I set "resume" while running test_single_image? HOT 8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fastfcn.