Comments (11)
How many threads and MPI ranks?
Do you have a stack trace for that?
from dbcsr.
@dithillobothrium Could you look at this problem?
from dbcsr.
BTW, which compiler are you using?
from dbcsr.
How many threads and MPI ranks?
compiled without OpenMP, single MPI rank (purely serial).
Do you have a stack trace for that?
after dbcsr.h:17
it's
(lldb) bt
* thread #1, queue = 'com.apple.main-thread', stop reason = signal SIGSTOP
* frame #0: 0x00000001002317c7 libgfortran.5.dylib`_gfortrani_internal_unpack_8 + 295
frame #1: 0x000000011d4c267f libdbcsr.dylib`__dbcsr_mpiwrap_MOD_mp_sum_lv at dbcsr_mpiwrap.F:9277
frame #2: 0x000000011d4714de libdbcsr.dylib`__dbcsr_mm_sched_MOD_dbcsr_mm_sched_lib_finalize at dbcsr_mm_sched.F:632
frame #3: 0x000000011d4698fc libdbcsr.dylib`__dbcsr_mm_MOD_dbcsr_multiply_lib_finalize at dbcsr_mm.F:186
frame #4: 0x000000011d40fa0d libdbcsr.dylib`__dbcsr_lib_MOD_dbcsr_finalize_lib at dbcsr_lib.F:129
with frame #0
: being:
(lldb) f 0
frame #0: 0x00000001002317c7 libgfortran.5.dylib`_gfortrani_internal_unpack_8 + 295
libgfortran.5.dylib`_gfortrani_internal_unpack_8:
-> 0x1002317c7 <+295>: cmpq %r13, %rax
0x1002317ca <+298>: jne 0x100231858 ; <+440>
0x1002317d0 <+304>: movq $0x0, -0x68(%rsp)
0x1002317d9 <+313>: cmpq $0x1, %r10
which compiler are you using?
from dbcsr.
@dithillobothrium Could you look at this problem?
I can look
from dbcsr.
OK, it is hanging on an allreduce for a single rank, which is pretty weird...
I had a similar problem time ago with OpenMPI 3.1.2 on a linux machine and it turned our that it is something broken in OpenMPI 3.1.x. Indeed 3.0.2 works. Could you confirm that?
from dbcsr.
in my case it works with gcc 7.3.1 and openmpi 3.1.2 on a linux machine
from dbcsr.
Indeed 3.0.2 works. Could you confirm that?
unfortunately not, 3.0.2
gives the same issue.
from dbcsr.
@alazzaro just installed gfortran and openmpi on our tcopt1 (macos high sierra) via brew and building DBCSR now (using cmake -DUSE_OPENMP=OFF
, just in case you want to try by yourself)
from dbcsr.
I can't reproduce it on macOS High Sierra with OpenMPI-3.1.3, clang-902.0.39.2 and gfortran-8.2.0
@davydden you can still reproduce it with openmpi-3.1.3?
from dbcsr.
you can still reproduce it with openmpi-3.1.3?
no, i can not. 3.1.3 works. Looks like it was indeed some issue with openmpi
from dbcsr.
Related Issues (20)
- Build fails on macOS with fypp errors HOT 1
- Two tests fail on macOS PPC: dbcsr_unittest2, dbcsr_unittest3 HOT 32
- Configure fails on macOS 10.7 and 10.10–10.13 due to Test f2008-norm2 failure HOT 5
- turn off and remove codecoverage checks HOT 3
- Error with mpich 4.1: Type mismatch in argument 'baseptr' at (1); passed TYPE(c_ptr) to INTEGER(4) HOT 26
- cuda tests are broken HOT 1
- Evaluate USE_ACCEL=opencl
- Test MPI_F08 HOT 1
- Default initializers
- mpich test failure on s390x HOT 2
- Thread number has changed error if OMP_DYNAMIC=TRUE HOT 3
- `TEST_MPI_RANKS=auto` does not account for ctest parallelization HOT 1
- Program received signal SIGFPE: Floating-point exception - erroneous arithmetic operation HOT 1
- OpenMP detection with Clang is broken in 2.6.0 HOT 10
- Compilation fails on Archer2 UK system HOT 7
- What levels of sparsity is this useful for? HOT 4
- Removal of dbcsr-data HOT 2
- CUDA RUNTIME API error: DeviceSetLimit failed with error cudaErrorInvalidValue HOT 6
- Consider to drop -Werror for tests/configs pulling external APIs/frameworks HOT 12
- DBCSR performs very poorly on GH200, when there are large blocks HOT 14
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dbcsr.