Comments (6)
To summarize current status, after switching from MPICH 3.3 to OpenMPI 2.1:
- All diagnostic collections off: timing info incomplete due to early termination (see geoschem/GCHP#6)
- Any diagnostic collections on: diagnostic write hang at end of run (this issue)
I suggest seeing if this issue goes away after upgrading to OpenMPI 3 since switching from OpenMPI 2 to OpenMPI 3 on the Odyssey cluster fixed this same issue after a switch to a new operating system (CentOS6 to CentOS7).
from gchp_legacy.
With OpenMPI 3.1.3:
- With no diagnostics, it does not crash at 00:10 (fixes no. 1 error).
- With any diagnostics, it is able to write output files (fixes no. 2 error):
- The timing info is still incomplete (no. 3 error still exits geoschem/GCHP#6 (comment) )
from gchp_legacy.
Because OpenMPI 3 seems the most robust configuration right now, I suggest focusing on fixing its timing issue, and put aside MPICH3 and OpenMPI2 for now.
I made another AMI ami-01074a30392daa0f9
with OpenMPI 3.1.3
cd ~/tutorial/gchp_standard
mpirun -np 6 -oversubscribe ./geos
-oversubscribe
is needed for OpenMPI 3 runtime when the # of physical cores is less the # of MPI processes.
from gchp_legacy.
Thanks Jiawei, I am closing this issue since switching to OpenMPI 3 fixed the file write hang issue. The issue for incomplete timing info at the end of the run is still open and will be tracked separately in geoschem/GCHP#6.
from gchp_legacy.
I think we should put a warning on wiki regarding issues with MPICH3 and OpenMPI2. On a lot of shared systems users cannot install whatever MPI they want, unlike the cloud.
from gchp_legacy.
from gchp_legacy.
Related Issues (20)
- [BUG/ISSUE] Incorrect regridding if file latitude data ends in +/- 90 HOT 4
- [BUG/ISSUE] Not printing the missing HEMCO data file that causes model crash HOT 13
- [BUG/ISSUE] Change in MAPL vertical flip rules impacting mesospheric chemistry HOT 1
- [BUG/ISSUE] H2O2AfterChem vertically flipped in restart HOT 2
- [BUG/ISSUE] MODIS LAI not properly updated at correct time HOT 6
- [BUG/ISSUE] Run failure in transport tracers simulation with 12.6.2 HOT 1
- [FEATURE REQUEST] ESMF v8 public release HOT 1
- [BUG/ISSUE] Run crashes in MAPL when running full chemistry simulation at c360 HOT 6
- [QUESTION]Should it make cleanup_output everytime at the beginging of smulation? HOT 4
- [BUG/ISSUE] Fullchem run failure in 12.7.0+ at c180+ due to reduced timesteps HOT 4
- [DISCUSSION] This repository will be retired in version 13.0.0
- [BUG/ISSUE] Monthly diagnostics incorrect for Feb in leap years if using multi-run option HOT 1
- [QUESTION] Error in MAPL_IO.F90 reading restart file?
- [BUG/ISSUE]Invoking MPI_ABORT causes Open MPI to kill all MPI processes when run GCHP at c360 HOT 9
- [BUG/ISSUE]make build_all, gchp error in ESMF: cpp/node/detail/node_iterator.h(64): error: namespace "std" has no member "addressof" HOT 2
- [BUG/ISSUE] compiling GCHP 12.8.2 HOT 8
- [QUESTION] Compiling GCHP failed HOT 5
- [BUG/ISSUE] Non-advected species concentrations not copied for output restart file
- [QUESTION] The dimensions of gchp restart file HOT 3
- [QUESTION] Why does GCHP fail when meteorology turned on in 12.9.3 HEMCO_Config.rc HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gchp_legacy.