You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The same update mechanism also applies to $G^{(1)}$ edge features $\mathbf{e}_{\alpha\beta}^{(1,l)}$. Therefore, these features evolve across layers and, via the $\mathbf{v}^{(2,l)}$-$\mathbf{e}^{(1,l)}$ identity, drive the updates on $G^{(2)}$.
61
+
59
62
60
-
The same update mechanism applies to $G^{(1)}$ edge features $\mathbf{e}_{\alpha\beta}^{(1,l)}$, so they also evolve across layers and, via the $\mathbf{v}^{(2,l)}$-$\mathbf{e}^{(1,l)}$ identity, drive the updates on $G^{(2)}$.
61
63
62
64
### Descriptor Construction
63
65
@@ -105,13 +107,12 @@ DPA3 uses LiGS order $K=2$ as the default configuration, which was found effecti
105
107
106
108
## Hyperparameter tests
107
109
108
-
We systematically conducted DPA3 training on six representative DFT datasets (available at [AIS-Square](https://www.aissquare.com/datasets/detail?pageType=datasets&name=DPA3_hyperparameter_search&id=316)):
109
-
metallic systems (`Alloy`, `AlMgCu`, `W`), covalent material (`Boron`), molecular system (`Drug`), and liquid water (`Water`).
110
-
Under consistent training conditions (0.5M training steps, batch_size "auto:128"),
111
-
we rigorously evaluated the impacts of some critical hyperparameters on validation accuracy.
110
+
We systematically trained DPA3 on six representative DFT datasets (available at [AIS-Square](https://www.aissquare.com/datasets/detail?pageType=datasets&name=DPA3_hyperparameter_search&id=316)): metallic systems (`Alloy`, `AlMgCu`, `W`), a covalent material (`Boron`), a molecular system (`Drug`), and liquid water (`Water`).
111
+
Under consistent training conditions (0.5M training steps, `batch_size` = `auto:128`),
112
+
we evaluated the impact of key hyperparameters on validation accuracy.
112
113
113
-
The comparative analysis focused on average RMSEs (Root Mean Square Error) for both energy, force and virial predictions across all six systems,
114
-
with results tabulated below to guide scenario-specific hyperparameter selection:
114
+
The comparative analysis focused on average RMSEs (Root Mean Square Error) for energy, force, and virial predictions across the six systems.
115
+
The results are summarized below to guide scenario-specific hyperparameter selection:
115
116
116
117
| Model | comment | nlayers | n_dim | e_dim | a_dim | e_sel | a_sel | start_lr | stop_lr | loss prefactors | rmse_e (meV/atom) | rmse_f (meV/Å) | rmse_v (meV/atom) | Training wall time (h) |
The loss prefactors (0.2|20, 100|60, 0.02|1) correspond to (`start_pref_e`|`limit_pref_e`, `start_pref_f`|`limit_pref_f`, `start_pref_v`|`limit_pref_v`) respectively.
127
+
The loss prefactors (0.2|20, 100|60, 0.02|1) correspond to (`start_pref_e`|`limit_pref_e`, `start_pref_f`|`limit_pref_f`, `start_pref_v`|`limit_pref_v`), respectively.
127
128
Virial RMSEs were averaged exclusively for systems containing virial labels (`Alloy`, `AlMgCu`, `W`, and `Boron`).
128
129
129
-
Note that we set `float32` in all DPA3 models, while `float64` in other models by default.
130
+
Note that all DPA3 models use `float32`, while other models use `float64` by default.
130
131
131
132
## Requirements of installation from source code {{ pytorch_icon }} {{ paddle_icon }}
0 commit comments