Skip to content

Commit b9e8c4b

Browse files
author
OpenClaw
committed
doc: polish DPA3 text and move edge-index clarification
Authored by OpenClaw (model: glm-5)
1 parent 7790cd4 commit b9e8c4b

1 file changed

Lines changed: 18 additions & 17 deletions

File tree

doc/model/dpa3.md

Lines changed: 18 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -4,12 +4,12 @@
44
**Supported backends**: PyTorch {{ pytorch_icon }}, JAX {{ jax_icon }}, DP {{ dpmodel_icon }}
55
:::
66

7-
DPA3 is an advanced interatomic potential leveraging the message passing architecture.
8-
Designed as a large atomic model (LAM), DPA3 is tailored to integrate and simultaneously train on datasets from various disciplines,
9-
encompassing diverse chemical and materials systems across different research domains.
10-
Its model design ensures exceptional fitting accuracy and robust generalization both within and beyond the training domain.
11-
Furthermore, DPA3 maintains energy conservation and respects the physical symmetries of the potential energy surface,
12-
making it a dependable tool for a wide range of scientific applications.
7+
DPA3 is an advanced interatomic potential based on message passing.
8+
As a large atomic model (LAM), it is designed to integrate and jointly train on datasets from different domains,
9+
covering diverse chemical and materials systems.
10+
Its architecture provides high fitting accuracy and robust generalization both within and beyond the training domain.
11+
DPA3 also preserves energy conservation and the physical symmetries of the potential energy surface,
12+
making it a reliable model for a wide range of scientific applications.
1313

1414
Reference: [DPA3 paper](https://arxiv.org/abs/2506.01686).
1515

@@ -44,20 +44,22 @@ The vertex features are updated through self-message and symmetrization:
4444
```
4545

4646
**For $G^{(k)}$ with $k > 1$:**
47-
The vertex feature of $G^{(k)}$ is identical to the edge feature of $G^{(k-1)}$. This identity eliminates redundant storage:
48-
Here $(\alpha,\beta)$ denotes the edge in $G^{(k-1)}$ corresponding to vertex $\alpha$ in $G^{(k)}$.
47+
The vertex feature of $G^{(k)}$ is identical to the edge feature of $G^{(k-1)}$:
4948

5049
```math
5150
\mathbf{v}_\alpha^{(k,l)} = \mathbf{e}_{\alpha\beta}^{(k-1,l)}
5251
```
5352

53+
where $(\alpha,\beta)$ denotes the edge in $G^{(k-1)}$ corresponding to vertex $\alpha$ in $G^{(k)}$. This identity eliminates redundant storage.
54+
5455
The edge features are updated based on messages from connected vertices:
5556

5657
```math
5758
\mathbf{e}_{\alpha\beta}^{(k,l+1)} = \mathbf{e}_{\alpha\beta}^{(k,l)} + \text{Update}^{(k)}\left(\mathbf{e}_{\alpha\beta}^{(k,l)}, \mathbf{v}_\alpha^{(k,l)}, \mathbf{v}_\beta^{(k,l)}\right)
5859
```
60+
The same update mechanism also applies to $G^{(1)}$ edge features $\mathbf{e}_{\alpha\beta}^{(1,l)}$. Therefore, these features evolve across layers and, via the $\mathbf{v}^{(2,l)}$-$\mathbf{e}^{(1,l)}$ identity, drive the updates on $G^{(2)}$.
61+
5962

60-
The same update mechanism applies to $G^{(1)}$ edge features $\mathbf{e}_{\alpha\beta}^{(1,l)}$, so they also evolve across layers and, via the $\mathbf{v}^{(2,l)}$-$\mathbf{e}^{(1,l)}$ identity, drive the updates on $G^{(2)}$.
6163

6264
### Descriptor Construction
6365

@@ -105,13 +107,12 @@ DPA3 uses LiGS order $K=2$ as the default configuration, which was found effecti
105107

106108
## Hyperparameter tests
107109

108-
We systematically conducted DPA3 training on six representative DFT datasets (available at [AIS-Square](https://www.aissquare.com/datasets/detail?pageType=datasets&name=DPA3_hyperparameter_search&id=316)):
109-
metallic systems (`Alloy`, `AlMgCu`, `W`), covalent material (`Boron`), molecular system (`Drug`), and liquid water (`Water`).
110-
Under consistent training conditions (0.5M training steps, batch_size "auto:128"),
111-
we rigorously evaluated the impacts of some critical hyperparameters on validation accuracy.
110+
We systematically trained DPA3 on six representative DFT datasets (available at [AIS-Square](https://www.aissquare.com/datasets/detail?pageType=datasets&name=DPA3_hyperparameter_search&id=316)): metallic systems (`Alloy`, `AlMgCu`, `W`), a covalent material (`Boron`), a molecular system (`Drug`), and liquid water (`Water`).
111+
Under consistent training conditions (0.5M training steps, `batch_size` = `auto:128`),
112+
we evaluated the impact of key hyperparameters on validation accuracy.
112113

113-
The comparative analysis focused on average RMSEs (Root Mean Square Error) for both energy, force and virial predictions across all six systems,
114-
with results tabulated below to guide scenario-specific hyperparameter selection:
114+
The comparative analysis focused on average RMSEs (Root Mean Square Error) for energy, force, and virial predictions across the six systems.
115+
The results are summarized below to guide scenario-specific hyperparameter selection:
115116

116117
| Model | comment | nlayers | n_dim | e_dim | a_dim | e_sel | a_sel | start_lr | stop_lr | loss prefactors | rmse_e (meV/atom) | rmse_f (meV/Å) | rmse_v (meV/atom) | Training wall time (h) |
117118
| ---------------- | --------------- | ------- | ------- | ------ | ----- | ------- | ------ | -------- | -------- | ------------------------- | ----------------- | -------------- | ----------------- | ---------------------- |
@@ -123,10 +124,10 @@ with results tabulated below to guide scenario-specific hyperparameter selection
123124
| | Large sel | 6 | 256 | 128 | 32 | **154** | **48** | 1e-3 | 3e-5 | 0.2\|20, 100\|60, 0.02\|1 | 4.76 | 78.4 | 40.2 | 31.8 |
124125
| DPA2-L6 (medium) | Default | 6 | - | - | - | - | - | 1e-3 | 3.51e-08 | 0.02\|1, 1000\|1, 0.02\|1 | 12.12 | 109.3 | 83.1 | 12.2 |
125126

126-
The loss prefactors (0.2|20, 100|60, 0.02|1) correspond to (`start_pref_e`|`limit_pref_e`, `start_pref_f`|`limit_pref_f`, `start_pref_v`|`limit_pref_v`) respectively.
127+
The loss prefactors (0.2|20, 100|60, 0.02|1) correspond to (`start_pref_e`|`limit_pref_e`, `start_pref_f`|`limit_pref_f`, `start_pref_v`|`limit_pref_v`), respectively.
127128
Virial RMSEs were averaged exclusively for systems containing virial labels (`Alloy`, `AlMgCu`, `W`, and `Boron`).
128129

129-
Note that we set `float32` in all DPA3 models, while `float64` in other models by default.
130+
Note that all DPA3 models use `float32`, while other models use `float64` by default.
130131

131132
## Requirements of installation from source code {{ pytorch_icon }} {{ paddle_icon }}
132133

0 commit comments

Comments
 (0)