Comments (4)
Just to be clear: you have a DVM running that you started with just prte
, and you have a set of procs that are being started via some separate method (and so they will appear as singletons) - correct? So the problem is to tell the singletons how to connect to the DVM?
The singletons will automatically look for a system server, so all you should need to do is start the DVM with the "system server" flag: prte --system-server
.
I see you configured --with-slurm
for some reason. If you are using Slurm to start the procs, that could be a problem as the procs may automatically connect to the Slurm daemon and not the DVM. Your only sure bet would be to add an option to your program that tries to circumvent that behavior, but I'd have to think about it for awhile and probably experiment a bit.
from ompi.
I'm starting my DVM with prte as follows:
prte --hostfile $HOME/gsotodos/conf/machines_mpi --report-uri $PRTEFILE --no-ready-msg &
I configured --with-slurm for other purposes, but as my application is running with Spark, I am starting my jobs with spark-submit. I tried to start my server with the flag --system-server but the clients still cannot connect with the server.
Is there any way to specify this configuration with environment variables or any way to launch my clients without mpiexec?
from ompi.
When you say "it cannot connect", what are you seeing that tells you this? I just tested it and the clients connect just fine. The issue may be in what data they expected to be able to access.
from ompi.
Related Issues (20)
- when i run mpi program using ASAN, asan reports some memory leaks HOT 1
- Error when using MPI_Comm_spawn with ULFM enabled HOT 6
- MPI_Status_f082f not part of the mpi_f08 interface HOT 13
- coll_tuned_dynamic_rules_filename option no way to set alltoall_algorithm_max_requests from the rules file
- coll_tuned_use_dynamic_rules wrong scoping for tools interface
- Fflush(stdout) doesn't work as expected. HOT 6
- small array of derived data type(in Fortran) can be sent by MPI_Isend and MPI_Irecv but it ran into errors when I augment the array HOT 4
- Error while building from source openmpi 5.0.3 HOT 2
- Fault tolerant error when re spawn process in mpiexec in remote node
- fortran .mod files installed in libdir instead of includedir HOT 34
- PMIX_ERROR when MPI_Comm_spawn in multiple nodes HOT 13
- coll tuned alltoall algorithm ignored after initialization
- Build fail on Mac M3 with macOS clang 15 HOT 1
- Mystery error on exit HOT 4
- pkgconfig files not installed with `--enable-script-wrapper-compilers` HOT 3
- mpirun nccl-test hang HOT 6
- cannot MPI_File_open a one-character filename, deletes external file anyways HOT 5
- Reduce_local Segmentation fault when Running with IMB-MPI1 built for GPU HOT 3
- mpirun 5.0.3 has bug on parse shell args while 4.1.6 works well. HOT 13
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ompi.