Giter Site home page Giter Site logo

Comments (3)

bbucior avatar bbucior commented on July 17, 2024 2

Thanks for reporting and trying it out on another platform!

I'm not too familiar with Google Colab but may have found a workaround. The source of the crash appears to be parsing the install_requires option in setup.py, which sets up a dependency in older Python 2.x configurations. Everything seemed to work for me after commenting out that line. (or adding a step like sed -i -e 's/install_requires/#install_requires/' setup.py to the install process).

Does it fix the error for you, too?

from mofid.

ngkayjay avatar ngkayjay commented on July 17, 2024 1

Yes it does, the fix works! Thanks!
Right now I'm on Google Colab as my HPC resource allocation has been approved, but not yet implemented. I suspect other users who would want to play around with ML on MOFs without institutional resources would appreciate your advice as well.

For other users on Colab, be advised to run !chmod -R 755 <YOUR_DIR> in Colab after you run pip install . to set proper privileges, otherwise you'd get a Errno 13 Permissions error.

One more question: how long does it take to construct a mofid for a given .cif file on your end? The authors whose work I'm reproducing had constructed the mofids for a dataset of 400k+ .cifs, but it takes me ~6s to construct a single mofid. I'm wondering where I should start my optimization.

from mofid.

bbucior avatar bbucior commented on July 17, 2024

Awesome, glad everything's working now!

For the ML training set, unfortunately calculating the MOFids is going to take awhile for a large folder of CIFs. Your calculation times are consistent with what I'm seeing on my laptop (make test runs through 28 CIFs in 1-2 minutes). If memory serves correctly, I ran MOF databases by splitting the CIFs into a few folders and ran them as parallel jobs on HPC resources (see Scripts/HPC/).

TBH, while you're waiting on HPC resources, your best bet to get started would probably be a precomputed MOFid.smi or similar structural information, if it's available in the SI of that paper or another compatible one. For example, our SmVAE paper includes an training set with RFcodes, so slightly different from MOFid but a similar intent. Maybe something like that could help get things off the ground until you get the compute resources for reproducing the original 400k+ dataset?

from mofid.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.