You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: doc/model/dpa2.md
+36Lines changed: 36 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,6 +8,42 @@ The DPA-2 model implementation. See [DPA-2 paper](https://doi.org/10.1038/s41524
8
8
9
9
Training example: `examples/water/dpa2/input_torch_medium.json`, see [README](../../examples/water/dpa2/README.md) for inputs in different levels.
10
10
11
+
## Theory
12
+
13
+
DPA-2 is an attention-based descriptor designed to learn expressive local atomic representations while preserving the physical symmetries required by interatomic potentials.
14
+
15
+
### Local environment and representation
16
+
17
+
For each central atom $\alpha$, neighbors $\beta \in \mathcal{N}(\alpha)$ are selected within a cutoff radius. DPA-2 encodes each local environment through geometric features (relative coordinates and derived invariants) and element/type information.
18
+
19
+
The descriptor is built hierarchically:
20
+
21
+
1.**Initial embedding**: geometric and type features are projected into latent channels.
22
+
1.**Attention-based interaction**: stacked attention layers model neighbor-neighbor and center-neighbor correlations in the local environment.
23
+
1.**Output descriptor**: atom-wise latent features after the final layer are used as descriptor outputs for downstream fitting/model components.
24
+
25
+
### Attention-based message passing
26
+
27
+
DPA-2 uses attention to aggregate neighbor information with data-dependent weights. Conceptually, each layer computes:
where $\mathbf{h}$ denotes latent node features and $\mathbf{g}_{\alpha\beta}$ denotes geometry-conditioned pair features. Residual updates enable stable deep stacking.
34
+
35
+
### Physical symmetries
36
+
37
+
DPA-2 is constructed to satisfy key symmetry requirements of atomistic modeling:
38
+
39
+
1.**Translational invariance**: only relative coordinates are used.
40
+
1.**Rotational behavior**: internal geometric constructions are designed so that final scalar descriptor channels used downstream are rotationally invariant.
41
+
1.**Permutational invariance**: atoms of the same species are treated identically under permutation (re-labeling) operations.
42
+
43
+
### Multi-task training context
44
+
45
+
DPA-2 is commonly used in a multi-task setting. The descriptor is shared, while task-specific heads/objectives are handled downstream. See [Multi-task training](../train/multi-task-training.md) for framework details.
46
+
11
47
## Requirements of installation {{ pytorch_icon }}
12
48
13
49
If one wants to run the DPA-2 model on LAMMPS, the customized OP library for the Python interface must be installed when [freezing the model](../freeze/freeze.md).
0 commit comments