Giter Site home page Giter Site logo

judftteam / best-of-atomistic-machine-learning Goto Github PK

View Code? Open in Web Editor NEW
249.0 6.0 19.0 4.54 MB

πŸ† A ranked list of awesome atomistic machine learning projects βš›οΈπŸ§¬πŸ’Ž.

License: Creative Commons Attribution Share Alike 4.0 International

atomistic-machine-learning awesome-list best-of-list computational-chemistry computational-materials-science condensed-matter density-functional-theory drug-discovery electronic-structure interatomic-potentials

best-of-atomistic-machine-learning's People

Contributors

blaiszik avatar irratzo avatar janosh avatar ml-evs avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

best-of-atomistic-machine-learning's Issues

Add project: OpenMM ML projects

Community request, moved here from #10.

Project details:

  • Project Name: OpenMM-ML
  • Github URL: openmm/openmm-ml
  • Category: md
  • Labels: md, mliap
  • License: MIT
  • Package Managers: conda:conda-forge/openmm-ml

  • Project Name: SPICE
  • Github URL: openmm/spice-dataset
  • Category: datasets
  • Labels: datasets, mliap, md
  • License: MIT
  • Package Managers: None

  • Project Name: NNPOps
  • Github URL: openmm/NNPOps
  • Category: mliap
  • Labels: mliap, md, lang-cpp
  • License: MIT
  • Package Managers: conda:conda-forge/nnpops

Add project: Materials repositories popular in AML research

Project details:

See below.

Additional context:

This is a tricky subject. Materials repositories. Should they be part of this list, or rather be in a separate "materials informatics" list or similar?

Yes: Other "static" datasets like QM9 are also listed here. Why? Because they are part of atomistic ML (AML) publications as much as the models on which they were trained. Same goes for publications that used materials repos like Materials Project, then.

No. This is a bigger undertaking. There are a lot of materials repositories. Which one should be added? Add only the homepagess as resources, or homepages and APIs as separate entries, or homepages and APIs and associated tools as separate entries?

A balance should be struck here. Adding all kinds of materials informatics and computational materials science tools in this AML list would dilute it. For example, pymatgen and spglib should not be listed here, but in such a dedicated list. So, strike a balance: Only add the minimal set of tools to access materials repo data via browser or via API. Two entries at most per repo as guideline.

List of projects to research

Note. Add only those that actually have a record of being used in published AML research.

  • Materials Project
  • AFLOWlib

List of projects to add to the list

Improve labels

Configuration Change:

Right now, labels have some issues.

Problem 1). Labels with emoji name don't show up in Explanations section.

Custom labels without an image but just a name (emoji or string) are not shown in README section Explanations. That makes some emoji labels confusing. For example, ⚑ for electrostatics can only be guessed at without an "Explanation" entry.

Possible solutions.

  • Labels with name emoji: replace with an image emoji (an actual image file to be added to config/images). That makes text less copy & paste-able.
  • Replace all custom labels with name string labels. Then they are all self-explanatory.
  • Read template docs again and see if I have not misunderstood. Otherwise, create a template issue.

Add project: paperswithcode

Project details:

This suggestion is not a single project, but several datasets and benchmark models aggregated via paperswithcode.

  • For some of the already existing list entries (e.g. QM9, OC20 dataset), maybe replace website with paperswithcode entry.
  • The paperswithcode tags could be entered as resource in respective category, if any. Collect here first, then assign.
  • Also add the linked models / packages.

Initial datasets finds.

Initial tags finds.

Add project: EDM

Project details:

  • Project Name: EDM
  • Description: E(3) Equivariant Diffusion Model for Molecule Generation in 3D.
  • Github URL: ehoogeboom/e3_diffusion_for_molecules
  • Category: generative
  • Labels: generative
  • License: MIT
  • Package Managers: None

Additional context:

Original publication: https://arxiv.org/abs/2203.17003v2

Add project: wfl

Project details:

  • Project Name: wfl
  • Description: Workflow is a Python toolkit for building interatomic potential creation and atomistic simulation workflows.
  • Github URL: libAtoms/workflow
  • Category: mliap
  • Labels: workflows, htc
  • License: None
  • Package Managers: None

Additional context:

Original publication: https://arxiv.org/abs/2306.11421v1

Use manual description copied from README.md rather than automatic one, that one is too short.

Add project: matsci.org

Project details:

  • Project Name: matsci.org
  • URL: https://matsci.org/
  • Description: "Computational materials science community, online forum."
  • Category: community-resource

Additional context:

Arguable whether this community resource should be put here, as it is more about general atomistic modeling and simulation, rather than atomistic machine learning. But on the other hand, there is no such dedicated AML community forum, so this is an okay place to put AML question. And AML is a sub- or site field of atomistic modeling will just merge into the general field as time moves on, so why even make a distinction.

Improve labels II

Configuration Change:

Moved here from previous issue.

Problem 2). Classification by categories is not enough.

Change from categorical (one-dimensional) to tagging (multi-dimensional; here, tags are called 'labels') system. In this system, every list item has labels, and the category is only one of them, the 'main' or 'emphasized' one. Example from issue #10: Package equisolve would gain the labels math, mlp, general-tools, and the category general-tools is the category = emphasized label.

TODO:

  • Add labels: One for each category (duplicate structure).
  • Issue #10 add labels for equisolve
  • Issue #24 add labels for flare

Add project: EquiformerV2

Project details:

  • Project Name: EquiformerV2
  • Github URL: atomicarchitects/equiformer_v2
  • Category: rep-learn
  • License: None
  • Package Managers: None

Additional context:

The actual model is implemented in the ocp package, see README.

Anonymous community feedback

Configuration Change:

Anonymous community feedback 2023-06-13.

  • Disable license warnings. Which licenses are risky is a matter of opinion.
  • Add an explanation of the "combined project-quality score".
  • MD category: Add LAMMPS & all the lammps-xxx repo integrating a given potential with lammps; as well as OpenMM and TorchMD, both usable with ML potentials.

Improvement suggestions to the best-of-generator template.

  • Allow projects in multiple categories.
    • For now, the template only allows one category per project. However, one can implement this also by duplicating a project entry for different categories. That should not require a change to the template. The question is, then, where to draw the line of duplication, in order to stay fair to every package. Ultimately, this shows the deficit of the category (tree-based) vs. label (tag-based) classification schemes. Simplest would be to just do this case-by-case, whenever there is a package developer / community request for it. In that case:
    • Put equisolve in multiple categories: math, mlp, general-tools. Developer request.
  • If the way the project score is calculated raises concern, create an issue with improvement suggestions for .

Add project: Artificial Intelligence for Science (AIRS)

Project details:

  • Project Name: Artificial Intelligence for Science (AIRS)
  • Github URL: https://github.com/divelab/AIRS
  • Category: General Tools or Representation Learning
  • License: GPL-3.0 license
  • Package Managers: None

  • Project Name: AI for Science Map
  • URL: https://www.air4.science/map
  • Category: Comunity resource
  • License: GPL-3.0 license
  • Description: Interactive visual treemap of the AI4Science research field, including atomistic machine learning, as of summer 2023, including papers, packages, learning resources.

TODO: add missing packages, resources from there here as well.


Update project: Flare

Update details:

Additional context:

The project is listed under General Tools. I think a better fit would be the Active Learning section since a core functionality is building force fields using AL.

Add project: TorchMD-NET

Community request, moved here from #10.

Project details:

  • Project Name: TorchMD-NET
  • Github URL: torchmd/torchmd-net
  • Category: md
  • Labels: md, mliap, rep-learn
  • License: MIT
  • Package Managers: None

Add project: matbench, MPContribs

Project details:

  • Project Name: MatBench
  • Github URL: materialsproject/matbench
  • Category: community
  • Labels: community, datasets, benchmarking
  • License: MIT
  • Package Managers: pypi:matbench

  • Project Name: MPContribs
  • Github URL: materialsproject/MPContribs
  • Category: datasets
  • Labels: datasets
  • License: MIT
  • Package Managers: pypi:mpcontribs-client

Add project: MEGAN

Project details:

  • Project Name: MEGAN: Multi Explanation Graph Attention Student
  • Github URL: aimat-lab/graph_attention_student
  • Category: xai
  • Labels: xai, rep-learn
  • License: MIT
  • Package Managers: None

Additional context:

Original publication: https://arxiv.org/abs/2305.15961v1

(technically, the original publication is http://arxiv.org/abs/2211.13236, but the two papers link to repos that do not exist anymore or have been updated since. therefore, assume the newer publication is the closest one to current state of code.)

Add project: Visual Graph Datasets

Project details:

  • Project Name: Visual Graph Datasets
  • Github URL: aimat-lab/visual_graph_datasets
  • Category: datasets
  • Tags: datasets, xai, rep-learn
  • License: MIT
  • Package Managers: None

Additional context:

Dataset used in original publication of #69 .

Add project: ML-for-CurieTemp-Predictions

Project details:

  • Project Name: ML-for-CurieTemp-Predictions
  • Description: Machine Learning Predictions of High-Curie-Temperature Materials
  • Github URL: msg-byu/ML-for-CurieTemp-Predictions
  • Category: rep-eng
  • Labels: rep-eng, single-paper, magnetism
  • License: MIT
  • Package Managers: None

Additional context:

Original publication: http://arxiv.org/abs/2307.06879.

Add project: MateriApps

Project details:

Additional context:

What’s MateriApps

A Portal Site of Materials Science Simulation for Computational Materials Science Researchers, Theoreticians, Experimentalists, and Computer Scientists

Add project: Citrine Informatics ERD projects

Project details:

  • Project Name: GlassPy
  • Github URL: drcassar/glasspy
  • Category: rep-eng
  • Labels: rep-eng glass, composition, materials
  • License: GPL-3.0
  • Package Managers: pypi:glasspy

Are compositional feature -- based models atomistic models? In this case, I decide for "Yes".


  • Project Name: SciGlass
  • Github URL: drcassar/SciGlass
  • Category: datasets
  • Labels: datasets, glass
  • License: MIT

  • Project Name: sl_discovery
  • Github URL: CitrineInformatics-ERD-public/sl_discovery
  • Category: materials-discovery
  • License: Apache-2.0
  • Package Managers: None
  • Labels: materials-discovery, single-paper

  • Project Name: Linear vs blackbox
  • Github URL: CitrineInformatics-ERD-public/linear-vs-blackbox
  • Category: xai
  • Labels: xai, single-paper, rep-eng
  • License: None

  • Project Name: closed-loop-acceleration-benchmarks
  • Github URL: aced-differentiate/closed-loop-acceleration-benchmarks
  • Category: materials-discovery
  • Labels: materials-discovery, active-learning, single-paper
  • License: MIT
  • Package Managers:

  • Project Name: EquivariantOperators.jl
  • Github URL: aced-differentiate/EquivariantOperators.jl
  • Category: math
  • Labels: math, symmetrized, lang-julia
  • License: MIT

  • Project Name: Closed-loop acceleration benchmarks
  • Github URL: aced-differentiate/closed-loop-acceleration-benchmarks
  • Category: materials-discovery
  • Labels: materials-discovery, active-learning, single-paper
  • License: MIT
  • Package Managers:


Add project: mlp (Proper orthogonal descriptors, POD)

Project details:

  • Project Name: mlp
  • Description: "Proper orthogonal descriptors for efficient and accurate interatomic potentials. Paper."
  • Github URL: https://github.com/cesmix-mit/mlp
  • Category: mlp
  • Labels: lang-julia
  • License: None
  • Package Managers: None

Additional context:

POD descriptor benchmark, as referenced in the corresponding paper https://doi.org/10.1016/j.jcp.2023.112030.

Original publication not linked in README as of 2023-12-03. Repo description not descriptive. Replace with custom description: Paper title plus link to paper.

Add project: Materials Project Charge Densities & mp-pyrho

Project details:

Additional context:

URL for the MP CD datasets project entry:

https://next-gen.materialsproject.org/ml/charge_densities

The CDs are really part of MP API now. But still worthwhile to emphasize with a separate entry that they exist in the first place.

Add project: Add all LAMMPS ML potentials

Community request, moved here from #10.

MD category: Add LAMMPS & all the lammps-xxx repo integrating a given potential with lammps; as well as OpenMM and TorchMD, both usable with ML potentials.

Subtasks OpenMM, TorchMD moved to issues #77, #78.

Add project: GT4SD

Project details:

  • Project Name: GT4SD
  • Github URL: GT4SD/gt4sd-core
  • Category: generative
  • Labels: generative, pre-trained, drug-discovery, rep-learn
  • License: MIT
  • Package Managers: pypi:gt4sd

Additional context:

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.