Giter Site home page Giter Site logo

Comments (17)

RuijieJ avatar RuijieJ commented on September 26, 2024 11

Hi, I get the same problem but the #7 issue is missing now. Can anyone please explain again how to solve the "Segmentation fault (core dumped)" error?

from deformable-convolution-v2-pytorch.

Simpatech-app avatar Simpatech-app commented on September 26, 2024 8

Hello
I have the same issue , I tried with 512, 256, ... even with 1 and I am receiving the same error. I have GeForce GTX1080. Any idea why I have still this problem?

from deformable-convolution-v2-pytorch.

cjnjuwhy avatar cjnjuwhy commented on September 26, 2024 8

Tips: You should change the codes before make.
so, first git clone the repo,
then make sure your gcc version >=4.9,
and change the code as mentioned @xvjiarui ,
last bash ./make.sh.
If Backward is not reentrant after python test, refer to issue #16

from deformable-convolution-v2-pytorch.

heartInsert avatar heartInsert commented on September 26, 2024 5

@Simpatech-app have you rebuiled the code?

Do you mean by running again sh make.sh???? if so, I already did it and still get this error. Any idea how to fix it? After making the project, should do any thing else?

Gentle , the make.sh will create a Directory named bulid in your project , after you change 1024 to 256 with @xvjiarui , you need to delete this Directory bulid , and run make.sh , then it will work.

And after that , you will meet another exception named Backward is not reentrant, I just delete those check_function

Hello,world

from deformable-convolution-v2-pytorch.

aaronpetok avatar aaronpetok commented on September 26, 2024 3

@cjnjuwhy Thank you very much,I solved the problem following your tips!

from deformable-convolution-v2-pytorch.

gzhcv avatar gzhcv commented on September 26, 2024

I get the same problem. ( torch1.0.0 cuda9.0)

from deformable-convolution-v2-pytorch.

xvjiarui avatar xvjiarui commented on September 26, 2024

#7 Please try this solution.

from deformable-convolution-v2-pytorch.

gzhcv avatar gzhcv commented on September 26, 2024

@xvjiarui thanks a lot for your help! I'll work on it

from deformable-convolution-v2-pytorch.

gzhcv avatar gzhcv commented on September 26, 2024

@xvjiarui Hi, another problem occured when running the function check_gradient_dconv() in test.py. (others function for checking in test.py seems running correctly )

error in deformable_col2im_cuda: too many resources requested for launch
Traceback (most recent call last):
File "test.py", line 624, in
check_gradient_dconv()
File "test.py", line 400, in check_gradient_dconv
eps=1e-3, atol=1e-3, rtol=1e-2, raise_exception=True))
File "/home/gzh/SoftWare/tf1.10/anaconda2/envs/python36/lib/python3.6/site-packages/torch/autograd/gradcheck.py", line 205, in gradcheck
'numerical:%s\nanalytical:%s\n' % (i, j, n, a))
File "/home/gzh/SoftWare/tf1.10/anaconda2/envs/python36/lib/python3.6/site-packages/torch/autograd/gradcheck.py", line 185, in fail_test
raise RuntimeError(msg)
RuntimeError: Jacobian mismatch for output 0 with respect to input 0,
numerical:tensor([[ 0.0000, -0.0243, -0.1477, ..., 0.0000, 0.0000, 0.0000],
[ 0.0000, -0.0236, -0.0276, ..., 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, ..., 0.0000, 0.0000, 0.0000],
...,
[ 0.0000, 0.0000, 0.0000, ..., 0.0000, -0.2121, 0.0206],
[ 0.0000, 0.0000, 0.0000, ..., 0.0058, -0.2543, 0.0000],
[ 0.0000, 0.0000, 0.0000, ..., 0.1695, 0.0015, 0.0480]],
dtype=torch.float64)
analytical:tensor([[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
...,
[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.],
[0., 0., 0., ..., 0., 0., 0.]], dtype=torch.float64)

The environment is

  • Titan Black (6G memory)
  • cuda8.0 cudnn6.0
  • torch 1.0
  • python 3.6

Any help?

from deformable-convolution-v2-pytorch.

xvjiarui avatar xvjiarui commented on September 26, 2024

I believe it is caused by different hardware.
Try to change

const int CUDA_NUM_THREADS = 1024;
and to some smaller number like 512 or 256.
This should help.

from deformable-convolution-v2-pytorch.

gzhcv avatar gzhcv commented on September 26, 2024

@xvjiarui It's okay! Thanks a lot~

from deformable-convolution-v2-pytorch.

xvjiarui avatar xvjiarui commented on September 26, 2024

Hello
I have the same issue , I tried with 512, 256, ... even with 1 and I am receiving the same error. I have GeForce GTX1080. Any idea why I have still this problem?

Segmentation fault could be solve by this #7.
CUDA_NUM_THREADS is responsible for too many resources requested for launch.

from deformable-convolution-v2-pytorch.

gzhcv avatar gzhcv commented on September 26, 2024

@Simpatech-app have you rebuiled the code?

from deformable-convolution-v2-pytorch.

Simpatech-app avatar Simpatech-app commented on September 26, 2024

@xvjiarui @gzhcv I have gcc version 5.4.0 and according to issue#7 the gcc >=4.9 should work well. However, I have still the same problem.

from deformable-convolution-v2-pytorch.

Simpatech-app avatar Simpatech-app commented on September 26, 2024

@Simpatech-app have you rebuiled the code?

Do you mean by running again sh make.sh???? if so, I already did it and still get this error. Any idea how to fix it? After making the project, should do any thing else?

from deformable-convolution-v2-pytorch.

ae86zhizhi avatar ae86zhizhi commented on September 26, 2024

Hello
I have the same issue , I tried with 512, 256, ... even with 1 and I am receiving the same error. I have GeForce GTX1080. Any idea why I have still this problem?

Segmentation fault could be solve by this #7.
CUDA_NUM_THREADS is responsible for too many resources requested for launch.

I have the same issue too many resources requested for launch on v100 sxm3

from deformable-convolution-v2-pytorch.

dtn97 avatar dtn97 commented on September 26, 2024

I have the same issue with rtx2070 :(
I make sure my gcc version is 5.4 and i changed CUDA_NUM_THREADS = 512, 256, 1

from deformable-convolution-v2-pytorch.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.