Skip to content

Commit d808684

Browse files
timduignanclaude
andcommitted
README speed bullet: v1 vs v2-teqabfhg table from BENCH
Replaces the standalone v2-teqabfhg ms numbers with a v1-vs-v2 table at 100/1k/5k/10k atoms. v2 is slower at small sizes (electrostatics overhead) and faster at large sizes; crossover around 5k atoms, 1.46x speedup at 10k. v1 runs with backbone-only compile (its best available), v2 with full-model compile thanks to the port of orbital-materials/orb#3074. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 parent 6ae0776 commit d808684

1 file changed

Lines changed: 11 additions & 1 deletion

File tree

README.md

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,17 @@ Alternatively, you can use Docker to run orb-models; [see instructions below](#d
2525

2626
Adding learned electrostatics (LES) to OrbMol-v2 costs essentially nothing on speed but slashes the error.
2727

28-
* **Speed.** With full-model `torch.compile`, forward-pass time on a single 80 GB GPU is ~30 ms at 100 atoms, ~42 ms at 1k atoms, ~116 ms at 5k atoms, and ~191 ms at 10k atoms (periodic systems). The no-per-atom-spin-head architecture is 1–12% faster than the previous LatentSpinHead-having v2 development checkpoint at every system size; single-graph inference is memory-capped around 10k atoms on 80 GB.
28+
* **Speed.** Adding learnable electrostatics on top of OrbMol-v1 introduces a Coulomb / PME path with fixed overhead per forward, but `model.compile(...)` now wraps the **full** regressor (heads + Coulomb included) thanks to the changes ported from `orbital-materials/orb#3074`. The net result on a single 80 GB GPU (forward time on periodic random crystals):
29+
30+
| Atoms | OrbMol-v1 (backbone-compile) | OrbMol-v2-teqabfhg (full-compile) |
31+
|---:|---:|---:|
32+
| 100 | 17 ms | 30 ms |
33+
| 1,000 | 37 ms | 42 ms |
34+
| 5,000 | 124 ms | 116 ms |
35+
| 10,000 | 278 ms | **191 ms** |
36+
| 50,000+ | OOM | OOM |
37+
38+
At small system sizes the electrostatics overhead dominates and v2 is slower; the crossover is around 5k atoms, after which v2 pulls ahead, reaching a **1.46× speedup at 10k atoms**. Single-graph inference is memory-capped around 10k atoms on 80 GB.
2939
* **Accuracy on GSCDB138** (5,000+ reaction energies covering noncovalent interactions, thermochemistry, isomerization, transition-metal chemistry, etc.; excluding reactions involving single-atom species). OrbMol-v2's overall Normalized Error Ratio drops from **6.05 → 1.62** (3.7× lower, comparable to a good DFT functional). The improvement concentrates in categories that explicit long-range electrostatics actually fixes:
3040
* Noncovalent interactions (NC): 5.96 → 1.66 (3.6× lower)
3141
* Thermochemistry (TC): 11.68 → 1.83 (6.4× lower)

0 commit comments

Comments
 (0)