Giter Site home page Giter Site logo

Comments (5)

JiaweiZhuang avatar JiaweiZhuang commented on June 12, 2024

I am trying to fix this issue by Increasing The Maximum Number Of Open Files.

On CentOS, the default number is:

$ ulimit -n
10000

Changing the number with ulimit -n 16384 leads to permission error. But you can edit /etc/security/limits.conf such that

centos soft nofile 16384
centos hard nofile 16384

where centos is the user name. Re-login, and ulimit -n should show the new number.

from gchp_legacy.

JiaweiZhuang avatar JiaweiZhuang commented on June 12, 2024

Problem solved by raising ulimit -n as above and reducing the output collections (only keep SpeciesConc). Complete log: run_c180_7days_N8n288_pass_hdf5_issue.log

However, the simulation finished with a very long trace of HDF5 error (the start and end of the trace are shown below). Hope it doesn't affect anything...

...
 NOT using buffer I/O for file: cap_restart
forrtl: error (78): process killed (SIGTERM)
Image              PC                Routine            Line        Source             
libifcoremt.so.5   00002AE2723BF555  for__signal_handl     Unknown  Unknown
libpthread-2.17.s  00002AE2743215D0  Unknown               Unknown  Unknown
libpthread-2.17.s  00002AE27432075D  __close               Unknown  Unknown
libhdf5.so.103.1.  00002AE271120B5B  Unknown               Unknown  Unknown
libhdf5.so.103     00002AE27110F572  H5FD_close            Unknown  Unknown
libhdf5.so.103     00002AE2710FADC4  H5F__dest             Unknown  Unknown
libhdf5.so.103     00002AE2710FC164  H5F_try_close         Unknown  Unknown
libhdf5.so.103     00002AE2710FBDDC  H5F__close_cb         Unknown  Unknown
libhdf5.so.103.1.  00002AE27118326E  Unknown               Unknown  Unknown
libhdf5.so.103     00002AE271261800  H5SL_try_free_saf     Unknown  Unknown
libhdf5.so.103     00002AE271183169  H5I_clear_type        Unknown  Unknown
libhdf5.so.103     00002AE2710EAA9E  H5F_term_package      Unknown  Unknown
libhdf5.so.103     00002AE27102D08A  H5_term_library       Unknown  Unknown
libc-2.17.so       00002AE27476BC29  Unknown               Unknown  Unknown
libc-2.17.so       00002AE27476BC77  Unknown               Unknown  Unknown
libifcoremt.so.5   00002AE2723B2BEF  for_exit              Unknown  Unknown
geos               00000000006FC3F6  MAIN__                     49  GEOSChem.F90
geos               000000000040FE42  Unknown               Unknown  Unknown
libc-2.17.so       00002AE274754495  __libc_start_main     Unknown  Unknown
geos               000000000040FD49  Unknown               Unknown  Unknown
forrtl: error (78): process killed (SIGTERM)
Image              PC                Routine            Line        Source             
libifcoremt.so.5   00002B6B76EB4555  for__signal_handl     Unknown  Unknown
libpthread-2.17.s  00002B6B78E165D0  Unknown               Unknown  Unknown

...

forrtl: error (78): process killed (SIGTERM)
Image              PC                Routine            Line        Source             
libifcoremt.so.5   00002B9210C0A555  for__signal_handl     Unknown  Unknown
libpthread-2.17.s  00002B9212B6C5D0  Unknown               Unknown  Unknown
libpthread-2.17.s  00002B9212B6B75D  __close               Unknown  Unknown
libhdf5.so.103.1.  00002B920F96BB5B  Unknown               Unknown  Unknown
libhdf5.so.103     00002B920F95A572  H5FD_close            Unknown  Unknown
libhdf5.so.103     00002B920F945DC4  H5F__dest             Unknown  Unknown
libhdf5.so.103     00002B920F947164  H5F_try_close         Unknown  Unknown
libhdf5.so.103     00002B920F946DDC  H5F__close_cb         Unknown  Unknown
libhdf5.so.103.1.  00002B920F9CE26E  Unknown               Unknown  Unknown
libhdf5.so.103     00002B920FAAC800  H5SL_try_free_saf     Unknown  Unknown
libhdf5.so.103     00002B920F9CE169  H5I_clear_type        Unknown  Unknown
libhdf5.so.103     00002B920F935A9E  H5F_term_package      Unknown  Unknown
libhdf5.so.103     00002B920F87808A  H5_term_library       Unknown  Unknown
libc-2.17.so       00002B9212FB6C29  Unknown               Unknown  Unknown
libc-2.17.so       00002B9212FB6C77  Unknown               Unknown  Unknown
libifcoremt.so.5   00002B9210BFDBEF  for_exit              Unknown  Unknown
geos               00000000006FC3F6  MAIN__                     49  GEOSChem.F90
geos               000000000040FE42  Unknown               Unknown  Unknown
libc-2.17.so       00002B9212F9F495  __libc_start_main     Unknown  Unknown
geos               000000000040FD49  Unknown               Unknown  Unknown
--------------------------------------------------------------------------

from gchp_legacy.

lizziel avatar lizziel commented on June 12, 2024

Thanks for reporting the fix!

from gchp_legacy.

lizziel avatar lizziel commented on June 12, 2024

Ah, now I see there were still problems after this fix. @JiaweiZhuang were you able to resolve this? I have been using ifort19 with OpenMPI4 and at high resolution without issue, but using more recent versions (both GCHP and MAPL History).

from gchp_legacy.

lizziel avatar lizziel commented on June 12, 2024

I am closing out this issue due to inactivity. If there are further problems related to this issue please open a new issue at GCHPctm.

from gchp_legacy.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.