You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -103,7 +103,7 @@ and sets this in relation to the GPU's speed of light (SOL), i.e., the fastest p
103
103
104
104
### Inspecting the logs
105
105
After 50 steps, the training will finish, and save the final model to `model.safetensors`. In addition, a log file will be created,
106
-
which contains the training log in JSON format. We can visualize the log using the [plot_training_run.py](scripts/plot_training_run.py) utility script:`
106
+
which contains the training log in JSON format. We can visualize the log using the [plot_training_run.py](scripts/plot_training_run.py) utility script:
107
107
```shell
108
108
uv run scripts/plot_training_run.py log.json
109
109
```
@@ -281,14 +281,14 @@ For the optimizer state, this will slow down the optimizer step drastically (mem
281
281
## Python bindings
282
282
While it is nice to demonstrate training in pure C++/Cuda, there are scenarios where it is desirable to use Python for training, e.g., when using an alternative learning-rate schedule.
283
283
284
-
The Python bindings are provided in the `src/bindings` directory, and can be built using the
284
+
The Python bindings are provided in the `src/binding` directory, and can be built using the
285
285
`_pyllmq` target. The library can be built manually (`-DPYTHON_BINDING=ON`), or directly into a wheel file
286
286
using `uv build --wheel`.
287
287
The [demo.py](scripts/demo.py) script provides an example of how to use the bindings. Running it with `uv run pyllmq-demo` will trigger the wheel build automatically.
288
288
289
289
Pre-built wheels are available from [GitHub Releases](https://github.com/IST-DASLab/llmq/releases) for convenience.
290
-
Download the latest `.whl` file and install it with `uv pip install 'pyllmq-0.2.0-cp312-abi3-linux_x86_64.whl[scripts]'`,
291
-
or run example scripts directly: `uv run --with 'pyllmq-0.2.0+cu128-cp312-abi3-linux_x86_64.whl[scripts]' pyllmq-demo`, replacing the file name as appropriate. The `[scripts]` extra installs additional packages that aren't strictly required for pyllmq, but are used in the utility scripts, such as `datasets` and `matplotlib`.
290
+
Download the latest `.whl` file and install it with `uv pip install 'pyllmq-0.3.2-cp312-abi3-linux_x86_64.whl[scripts]'`,
291
+
or run example scripts directly: `uv run --with 'pyllmq-0.3.2+cu128-cp312-abi3-linux_x86_64.whl[scripts]' pyllmq-demo`, replacing the file name as appropriate. The `[scripts]` extra installs additional packages that aren't strictly required for pyllmq, but are used in the utility scripts, such as `datasets` and `matplotlib`.
292
292
The wheels are built against CUDA 12.8 and 13.0 and support compute capabilities 89, 90, 100f, and 120f.
293
293
294
294
By design, the bindings expose only coarse-grained operations; that is, the minimum unit
0 commit comments