Skip to content

Commit 5a053c7

Browse files
committed
merge models.yml
2 parents 76c55d4 + c7bd4ab commit 5a053c7

197 files changed

Lines changed: 15005 additions & 1100 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

.pre-commit-config.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
# pre-commit install
33
repos:
44
- repo: https://github.com/pre-commit/pre-commit-hooks
5-
rev: v5.0.0
5+
rev: v6.0.0
66
hooks:
77
- id: end-of-file-fixer
88
- id: mixed-line-ending
@@ -11,7 +11,7 @@ repos:
1111

1212
- repo: https://github.com/astral-sh/ruff-pre-commit
1313
# Ruff version.
14-
rev: v0.11.13
14+
rev: v0.15.5
1515
hooks:
1616
# Run the linter.
1717
- id: ruff-check
@@ -20,7 +20,7 @@ repos:
2020
- id: ruff-format
2121

2222
- repo: https://github.com/numpy/numpydoc
23-
rev: v1.8.0
23+
rev: v1.10.0
2424
hooks:
2525
- id: numpydoc-validation
2626
files: ^ml_peg/

README.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ python3 -m pip install git+https://github.com/ddmms/ml-peg.git
4040

4141
## Features
4242

43-
Coming soon!
43+
More details coming soon!
4444

4545

4646
## Development
@@ -65,6 +65,12 @@ pytest -v # Discover and run all tests
6565
Please refer to the [online documentation](https://ddmms.github.io/ml-peg/developer_guide/index.html)
6666
for information about contributing new benchmarks and models.
6767

68+
### Tutorials
69+
70+
We encourage developers new to the ML-PEG framework to work through the detailed
71+
step-by-step guides provided by our Jupyter Notebook tutorials:
72+
- [Adding a New Benchmark](docs/source/tutorials/python/adding_benchmark.ipynb) [![badge](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/ddmms/ml-peg/blob/main/docs/source/tutorials/python/adding_benchmark.ipynb)
73+
6874

6975
## Docker/Podman images
7076

docs/source/developer_guide/index.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,7 @@ Developer guide
66
:maxdepth: 3
77

88
get_started
9+
llm_agent_integration
910
add_benchmarks
1011
add_category
1112
running
Lines changed: 79 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,79 @@
1+
=========================
2+
LLM-Assistant Integration
3+
=========================
4+
5+
When using LLM coding assistants (e.g. `Claude Code <https://code.claude.com>`_) for development or usage of this package, please see some tips and tricks here.
6+
7+
Docs on Context7
8+
----------------
9+
10+
ML-PEG documentation is indexed on `Context7 <https://context7.com>`_, which supplies up-to-date parsed documentation for the package as an MCP server.
11+
12+
**Library ID**: ``/ddmms/ml-peg``
13+
14+
Getting started
15+
+++++++++++++++
16+
17+
Install the MCP server, e.g. for Claude Code:
18+
19+
.. code-block:: text
20+
21+
claude mcp add context7 -- npx -y @upstash/context7-mcp
22+
23+
See `installation instructions <https://github.com/upstash/context7#installation>`_ for other tools as well.
24+
25+
Afterwards can ask the agent to load the library, or you can add this to your agent's standard instructions, e.g. `CLAUDE.md` file.
26+
27+
.. code-block:: text
28+
29+
@context7 load library /ddmms/ml-peg
30+
31+
Claude Code will use the ``/ddmms/ml-peg`` library ID to fetch relevant documentation and code examples.
32+
33+
Common Use Cases
34+
++++++++++++++++
35+
36+
Asking to query the documentation directly:
37+
38+
.. code-block:: text
39+
40+
@context7 How do I add a new benchmark calculation script?
41+
42+
Asking about further specifics:
43+
44+
.. code-block:: text
45+
46+
How do I structure a calculation script using pytest parametrization? use context7
47+
48+
Related Documentation
49+
---------------------
50+
51+
For related libraries on Context7:
52+
53+
- **mlipx** (``/basf/mlipx``): Base calculator abstraction library used by ML-PEG
54+
- **janus-core** (``/stfc/janus-core``: another back-end for ML-PEG
55+
56+
57+
API Keys and Rate Limits
58+
-------------------------
59+
60+
Context7 provides free access with rate limits at the time of writing.
61+
For higher rate limits and being able to submit new documentation sources to index:
62+
63+
1. Visit https://context7.com
64+
2. Create an account
65+
3. Generate an API key from the dashboard
66+
4. Configure your AI assistant with the API key
67+
68+
.. note::
69+
70+
ML-PEG is a public repository and does not require an API key for basic usage through Context7.
71+
72+
Keeping Documentation Updated
73+
-----------------------------
74+
75+
The ML-PEG library on Context7 is automatically updated when changes are pushed to the main branch. Documentation typically updates within 24 hours of changes being merged. If you notice outdated information you can click to refresh on the site, `you need to log in to do this`.
76+
77+
.. tip::
78+
79+
When contributing new benchmarks or features, ensure your docstrings and documentation updates are merged to main before using Context7 to help others understand your additions.

docs/source/index.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
Welcome to ML-PEG's documentation!
2-
========================================
2+
==================================
33

44
ML-PEG is an ML Performance and Extrapolation Guide.
55

Lines changed: 42 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,42 @@
1+
==========
2+
Conformers
3+
==========
4+
5+
ACONFL
6+
======
7+
8+
Summary
9+
-------
10+
11+
Performance in predicting relative conformer energies of 12 C12H26,
12+
16 C16H34 and 20 C20H42 conformers. Reference data from PNO-LCCSD(T)-F12/ AVQZ calculations.
13+
14+
Metrics
15+
-------
16+
17+
1. Conformer energy error
18+
19+
For each complex, the the relative energy is calculated by taking the difference in energy
20+
between the given conformer and the reference (zero-energy) conformer. This is
21+
compared to the reference conformer energy, calculated in the same way.
22+
23+
Computational cost
24+
------------------
25+
26+
Low: tests are likely to take minutes to run on CPU.
27+
28+
Data availability
29+
-----------------
30+
31+
Input structures:
32+
33+
* Conformational Energy Benchmark for Longer n-Alkane Chains
34+
Sebastian Ehlert, Stefan Grimme, and Andreas Hansen
35+
The Journal of Physical Chemistry A 2022 126 (22), 3521-3535
36+
DOI: 10.1021/acs.jpca.2c02439
37+
38+
Reference data:
39+
40+
* Same as input data
41+
* :math:`PNO-LCCSD(T)-F12/ AVQZ` level of theory: a local, explicitly
42+
correlated coupled cluster method.

docs/source/user_guide/benchmarks/index.rst

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,3 +12,7 @@ Benchmarks
1212
molecular_crystal
1313
molecular
1414
bulk_crystal
15+
lanthanides
16+
non_covalent_interactions
17+
tm_complexes
18+
conformers
Lines changed: 45 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,45 @@
1+
===========
2+
Lanthanides
3+
===========
4+
5+
Isomer complexes
6+
================
7+
8+
Summary
9+
-------
10+
11+
Performance in predicting relative isomer energies for lanthanide complexes
12+
compared to r2SCAN-3c DFT reference data.
13+
14+
15+
Metrics
16+
-------
17+
18+
1. Relative isomer energy MAE
19+
20+
Accuracy of relative isomer energy predictions.
21+
22+
For each complex, the relative isomer energies are computed with respect to the
23+
lowest-energy isomer in the r2SCAN-3c reference set and compared to the r2SCAN-3c
24+
relative energies reported in the reference dataset.
25+
26+
27+
Computational cost
28+
------------------
29+
30+
Low: tests are likely to take less than a minute to run on CPU.
31+
32+
33+
Data availability
34+
-----------------
35+
36+
Input structures:
37+
38+
* T. Rose, M. Bursch, J.-M. Mewes, and S. Grimme, Fast and Robust Modeling of
39+
Lanthanide and Actinide Complexes, Biomolecules, and Molecular Crystals with
40+
the Extended GFN-FF Model, Inorganic Chemistry 63 (2024) 19364-19374.
41+
42+
Reference data:
43+
44+
* Relative isomer energies from r2SCAN-3c (see Supporting Information of the
45+
above reference).

docs/source/user_guide/benchmarks/molecular.rst

Lines changed: 50 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -124,3 +124,53 @@ Reference data:
124124

125125
* Same as input data
126126
* DLPNO-CCSD(T)/CBS
127+
128+
129+
BMIM Cl RDF
130+
===========
131+
132+
Summary
133+
-------
134+
135+
Tests whether MLIPs incorrectly predict covalent bond formation between chloride
136+
anions (Cl⁻) and carbon atoms in 1-butyl-3-methylimidazolium (BMIM⁺) cations.
137+
Such Cl-C bonds should NOT form in the ionic liquid under normal conditions.
138+
139+
This benchmark runs NVT molecular dynamics simulations of BMIM Cl at
140+
353.15 K and analyses the Cl-C RDF to detect any unphysical bond formation.
141+
142+
143+
Metrics
144+
-------
145+
146+
1. Cl-C Bonds Formed
147+
148+
Binary metric indicating whether unphysical Cl-C bonds formed during the MD simulation.
149+
150+
The Cl-C RDF is computed from the MD trajectory. If the RDF shows a peak (g(r) > 0.1)
151+
at distances below 2.5 Å, this indicates bond formation and the model fails the test.
152+
153+
* 0 = no bonds formed (correct physical behaviour)
154+
* 1 = bonds formed (unphysical, model failure)
155+
156+
157+
Computational cost
158+
------------------
159+
160+
Medium: tests require running 10,000 steps of Langevin MD for a system of 10 ion
161+
pairs, which may take tens of minutes on GPU.
162+
163+
164+
Data availability
165+
-----------------
166+
167+
Input structures:
168+
169+
* Generated using molify from SMILES representations of BMIM⁺ (CCCCN1C=C[N+](=C1)C)
170+
and Cl⁻ ions, packed to experimental density of 1052 kg/m³ at 353.15 K.
171+
* Zills, F. molify: Molecular Structure Interface. Journal of Open Source Software
172+
10, 8829 (2025). https://doi.org/10.21105/joss.08829
173+
* Density from: Yang, F., Wang, D., Wang, X. & Liu, Z. Volumetric Properties of
174+
Binary and Ternary Mixtures of Bis(2-hydroxyethyl)ammonium Acetate with Methanol,
175+
N,N-Dimethylformamide, and Water at Several Temperatures. J. Chem. Eng. Data 62,
176+
3958-3966 (2017). https://doi.org/10.1021/acs.jced.7b00654

docs/source/user_guide/benchmarks/molecular_crystal.rst

Lines changed: 50 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -86,3 +86,53 @@ Reference data:
8686

8787
* Same as input data
8888
* PBE-D3(BJ)
89+
90+
91+
CPOSS209
92+
=========
93+
94+
Summary
95+
-------
96+
97+
Performance in predicting lattice energies of 209 organic molecular crystals from the CPOSS209 dataset.
98+
99+
100+
Metrics
101+
-------
102+
103+
1. Absolute lattice energy MAE
104+
105+
Accuracy of the absolute lattice energy predictions.
106+
107+
For each molecular crystal, lattice energy is calculated by taking the difference between the energy of the solid molecular crystal divided by the number of molecules it comprises,
108+
and the energy of the isolated molecule. This is compared to the reference lattice energy.
109+
110+
2. Relative lattice energy MAE
111+
112+
Accuracy of the reltive lattice energy predictions.
113+
114+
We compute the lattice energies in the same way as before, but this time we compute the relative error.
115+
The relative error is computed by identifying the most stable polymorphs and, for each crystal, subtracting the lattice energy of the most stable polymorph.
116+
This metric cares more about the ranking of the crystal polymorphs rather than reproducing the reference values of the lattice energies.
117+
118+
119+
Computational cost
120+
------------------
121+
122+
Low: tests should take a few minutes on a CPU
123+
124+
125+
Data availability
126+
-----------------
127+
128+
Input structures:
129+
130+
* Structures are optimized at the PBE+TS level.
131+
* Louise S. Price, Matteo Paloni, Matteo Salvalaglio, and Sarah L. Price
132+
Crystal Growth & Design 2025 25 (9), 3186-3209
133+
DOI: 10.1021/acs.cgd.5c00255
134+
135+
Reference data:
136+
137+
* Same as input data
138+
* wb97md3 with 1b CCSD(T) corrections

0 commit comments

Comments
 (0)