Giter Site home page Giter Site logo

Mystery error on exit about ompi HOT 4 CLOSED

PhilipDeegan avatar PhilipDeegan commented on July 19, 2024
Mystery error on exit

from ompi.

Comments (4)

rhc54 avatar rhc54 commented on July 19, 2024

I'm unaware of any limitation on number of concurrent mpiruns, but I don't really understand what you are trying to do. A far cleaner way of doing this would be to start the PRRTE DVM (just prte) and then use prun to launch the individual jobs. Avoids all the overhead of starting the RTE over and over again, and loading the file system with creating and removing all the session directories for each of those mpirun instances.

Setting that aside, all the output is telling you is that one of your processes didn't exit properly - likely failed to call MPI_Finalize before terminating. You'd get a different error message if it had segfault'd, so I suspect that isn't what happened. Probably just something that triggered an error escape in your job.

from ompi.

PhilipDeegan avatar PhilipDeegan commented on July 19, 2024

to start the PRRTE DVM (just prte) and then use prun to launch

I am not experienced in any of this, so its' not something that I know much about, but can look into it

all the output is telling you is that one of your processes didn't exit properly

Yes I can see that, the issue here is that from my point of view, it shouldn't be happening, and it only happens sometimes. With no real clear indication of what's happening or why

Anyway, I haven't seen it since upgrading from fedora 39, to 40, so hopefully it's transient

from ompi.

hominhquan avatar hominhquan commented on July 19, 2024

This may be related to #10117 ?

from ompi.

rhc54 avatar rhc54 commented on July 19, 2024

No - totally unrelated unless you see your procs are crashing, which isn't what you report. It sounds to me like the issue is something in your integration with the OS if upgrading fedora solves the problem. I very much doubt it is something in OMPI causing you to exit improperly - that would almost always show as a segfault.

from ompi.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.