You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: doc/env.md
+8-6Lines changed: 8 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -95,17 +95,18 @@ List of customized OP plugin libraries to load, such as `/path/to/plugin1.so:/pa
95
95
{{ pytorch_icon }} Enable the built-in PyTorch Kineto profiler for the PyTorch C++ (inference) backend.
96
96
97
97
**Type**: string (output file stem)
98
+
98
99
**Default**: unset (disabled)
99
100
100
-
When set to a non-empty value, profiling is enabled for the lifetime of the loaded PyTorch model (e.g. during LAMMPS runs). A JSON trace file is written on finish. The final file name is constructed as:
101
+
When set to a non-empty value, profiling is enabled for the lifetime of the loaded PyTorch model (e.g. during LAMMPS runs). A JSON trace file is created on finish. The final file name is constructed as:
101
102
102
-
-`<ENV_VALUE>_gpu<ID>.json` if running on GPU (multi-GPU safe: the CUDA device id is appended)
103
+
-`<ENV_VALUE>_gpu<ID>.json` if running on GPU
103
104
-`<ENV_VALUE>.json` if running on CPU
104
105
105
-
The trace is compatible with [Chrome trace viewer](https://ui.perfetto.dev/) (alternatively chrome://tracing) and PyTorch profiler tooling. It includes:
106
+
The trace can be examined with [Chrome trace viewer](https://ui.perfetto.dev/) (alternatively chrome://tracing). It includes:
106
107
107
-
- CPU operator activities (always)
108
-
- CUDA activities (if GPU available)
108
+
- CPU operator activities
109
+
- CUDA activities (if available)
109
110
110
111
Example:
111
112
@@ -119,4 +120,5 @@ Tips:
119
120
120
121
- Large runs can generate sizable JSON files; consider limiting numbers of MD steps, like 20.
121
122
- Currently this feature only supports single process, or multi-process runs where each process uses a distinct GPU on the same node.
0 commit comments