judftteam / best-of-atomistic-machine-learning Goto Github PK
View Code? Open in Web Editor NEWπ A ranked list of awesome atomistic machine learning projects βοΈπ§¬π.
License: Creative Commons Attribution Share Alike 4.0 International
π A ranked list of awesome atomistic machine learning projects βοΈπ§¬π.
License: Creative Commons Attribution Share Alike 4.0 International
Community request, moved here from #10.
Project details:
Project details:
Additional context:
OC20, OC22
Project details:
See below.
Additional context:
This is a tricky subject. Materials repositories. Should they be part of this list, or rather be in a separate "materials informatics" list or similar?
Yes: Other "static" datasets like QM9 are also listed here. Why? Because they are part of atomistic ML (AML) publications as much as the models on which they were trained. Same goes for publications that used materials repos like Materials Project, then.
No. This is a bigger undertaking. There are a lot of materials repositories. Which one should be added? Add only the homepagess as resources, or homepages and APIs as separate entries, or homepages and APIs and associated tools as separate entries?
A balance should be struck here. Adding all kinds of materials informatics and computational materials science tools in this AML list would dilute it. For example, pymatgen
and spglib
should not be listed here, but in such a dedicated list. So, strike a balance: Only add the minimal set of tools to access materials repo data via browser or via API. Two entries at most per repo as guideline.
List of projects to research
Note. Add only those that actually have a record of being used in published AML research.
List of projects to add to the list
Project details:
Additional context:
https://arxiv.org/abs/2307.04340
Paper with online demo, but no code. But the online demo can be used like a prompt-based crystal structure file generator.
Configuration Change:
Right now, labels have some issues.
Problem 1). Labels with emoji name
don't show up in Explanations section.
Custom labels without an image
but just a name
(emoji or string) are not shown in README section Explanations. That makes some emoji labels confusing. For example, β‘ for electrostatics
can only be guessed at without an "Explanation" entry.
Possible solutions.
name
emoji: replace with an image
emoji (an actual image file to be added to config/images
). That makes text less copy & paste-able.name
string labels. Then they are all self-explanatory.Update details:
Manually add the project's license info in projects.yaml
.
Project details:
This suggestion is not a single project, but several datasets and benchmark models aggregated via paperswithcode.
Initial datasets finds.
Initial tags finds.
Project details:
Additional context:
Original publication: https://arxiv.org/abs/2203.17003v2
Project details:
Additional context:
via https://twitter.com/MathieuEmile/status/1674809578611982338
Project details:
Additional context:
https://www.nature.com/articles/s41586-019-1335-8#data-availability
Project details:
Additional context:
Project details:
Additional context:
Original publication https://doi.org/10.1038/s41598-017-01251-z
Project details:
Additional context:
Original publication: https://arxiv.org/abs/2306.11421v1
Use manual description copied from README.md rather than automatic one, that one is too short.
Project details:
Additional context:
Arguable whether this community resource should be put here, as it is more about general atomistic modeling and simulation, rather than atomistic machine learning. But on the other hand, there is no such dedicated AML community forum, so this is an okay place to put AML question. And AML is a sub- or site field of atomistic modeling will just merge into the general field as time moves on, so why even make a distinction.
Project details:
Configuration Change:
Moved here from previous issue.
Problem 2). Classification by categories is not enough.
Change from categorical (one-dimensional) to tagging (multi-dimensional; here, tags are called 'labels') system. In this system, every list item has labels, and the category is only one of them, the 'main' or 'emphasized' one. Example from issue #10: Package equisolve would gain the labels math
, mlp
, general-tools
, and the category general-tools
is the category = emphasized label.
TODO:
Project details:
Additional context:
Original publication https://doi.org/10.1039/C7SC04934J
Project details:
Additional context:
Project details:
Additional context:
More general ML, computer vision. But paper also shows application example to SO(3)-equivariant molecules dataset.
Original publication: https://openreview.net/forum?id=WE4qe9xlnQw.
Project details:
Additional context:
https://doi.org/10.1038/s41524-023-01062-z
Live demo.
Project details:
Additional context:
Project details:
AML projects by https://wmd-group.github.io/
Project details:
Additional context:
The actual model is implemented in the ocp package, see README.
Project details:
Additional context:
https://www.pnas.org/doi/10.1073/pnas.1801181115
Note: This seems to be a reimplementation.
Configuration Change:
Anonymous community feedback 2023-06-13.
Improvement suggestions to the best-of-generator template.
math
, mlp
, general-tools
. Developer request.Project details:
Rather use the resource URL.
https://docs.materialsproject.org/services/mpcontribs
Why add this project in this atomistic ML list? Because of "MPContribsML", "Benchmark datasets for Machine Learning".
Project details:
TODO: add missing packages, resources from there here as well.
Update details:
Additional context:
The project is listed under General Tools. I think a better fit would be the Active Learning section since a core functionality is building force fields using AL.
Project details:
Additional context:
A predecessor to e3nn. Only of historical significance, in that respect.
Project details:
Additional context:
https://www.nature.com/articles/s41524-022-00729-3
Comparison of Word2Vec, Atom2Vec, Mat2Vec, SkipAtom.
https://blog.materialis.ai/distributed-representations-of-atoms.html
Project details:
AML projects by https://github.com/materialsintelligence
Community request, moved here from #10.
Project details:
Project details:
Project details:
Additional context:
Original publication: https://arxiv.org/abs/2305.15961v1
(technically, the original publication is http://arxiv.org/abs/2211.13236, but the two papers link to repos that do not exist anymore or have been updated since. therefore, assume the newer publication is the closest one to current state of code.)
Project details:
Additional context:
Dataset used in original publication of #69 .
Project details:
Add all missing projects from Wasmer's research library, go to tags (lower left): library
, with-code
, with-data
, database
, dataset
. For each entry, go to Attachments (upper right) > repository.
Project details:
Additional context:
https://blog.materialis.ai/predicting-thermoelectric-transport-properties.html
Project details:
Additional context:
Original publication: http://arxiv.org/abs/2307.06879.
Project details:
Additional context:
Project details:
footer.md
)Additional context:
Whatβs MateriApps
A Portal Site of Materials Science Simulation for Computational Materials Science Researchers, Theoreticians, Experimentalists, and Computer Scientists
Project details:
Are compositional feature -- based models atomistic models? In this case, I decide for "Yes".
Project details:
Additional context:
POD descriptor benchmark, as referenced in the corresponding paper https://doi.org/10.1016/j.jcp.2023.112030.
Original publication not linked in README as of 2023-12-03. Repo description not descriptive. Replace with custom description: Paper title plus link to paper.
Configuration Change:
See correspondent upstream issue.
Project details:
Additional context:
URL for the MP CD datasets project entry:
https://next-gen.materialsproject.org/ml/charge_densities
The CDs are really part of MP API now. But still worthwhile to emphasize with a separate entry that they exist in the first place.
Additional context:
Projects by DIVE Lab.
Project details:
Dataset.
Package.
Project details:
Project details:
Additional context:
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.