You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Support for reading text-generation profiles from the Bitloops daemon config schema, including shared runtimes under `[inference.runtimes.<name>]`.
6
+
- Validation for legacy profile fields, unsupported profile keys, missing runtime references, invalid numeric values, and Ollama chat URLs that do not target `/api/chat`.
7
+
8
+
### Changed
9
+
10
+
-`bitloops-inference` now reads `task`, `driver`, and `runtime` from daemon-style profile definitions instead of the older per-profile `kind`, `provider_name`, and `timeout_secs` fields.
11
+
- Non-`text_generation` inference profiles are ignored during config loading and profile discovery, so CLI commands only expose runnable text-generation profiles.
12
+
- Documentation and test fixtures now use the daemon config layout and string-based temperature examples with environment interpolation.
13
+
14
+
### Fixed
15
+
16
+
- Request timeouts are now resolved from the referenced runtime’s `request_timeout_secs` value instead of per-profile timeout settings.
17
+
- Config validation now reports when a config file does not define any text-generation profiles.
18
+
19
+
## [0.1.0] - 2026-04-13
20
+
21
+
### Added
22
+
23
+
- Initial `bitloops-inference` Rust workspace with a shared protocol crate and a stdio runtime for out-of-process Bitloops inference.
24
+
- Protocol v1 request and response types for `describe`, `infer`, and `shutdown`, using line-delimited JSON over `stdin` and `stdout`.
25
+
-`run`, `validate-config`, and `describe-profile` CLI commands for running the runtime and inspecting configured inference profiles.
26
+
- OpenAI Chat Completions and Ollama Chat providers with normalised text and `json_object` responses, usage reporting, finish reasons, and provider-specific HTTP error handling.
27
+
- TOML-based profile configuration with environment-variable interpolation and default inference settings for temperature and output token limits.
28
+
- Mocked provider integration tests, child-process protocol-loop tests, hosted-runner CI, and release automation for macOS, Linux, and Windows artefacts.
29
+
30
+
### Fixed
31
+
32
+
- Intel macOS release builds now use the correct hosted runner label in the release workflow.
33
+
- Release packaging and GitHub Release artefact publication now generate the expected target-specific archives and clean up stale assets.
Profiles are defined under `[inference.profiles.<name>]`.
26
+
`bitloops-inference` reads the Bitloops daemon inference config. Text-generation profiles live under `[inference.profiles.<name>]` and reference a runtime from `[inference.runtimes.<name>]`.
String fields support `${ENV_VAR}` interpolation. Missing environment variables fail validation immediately. Non-text-generation profiles in the same daemon config are ignored by `bitloops-inference`.
50
53
51
-
## Supported provider kinds
54
+
## Supported drivers
52
55
53
56
-`openai_chat_completions`
54
57
-`ollama_chat`
@@ -83,19 +86,19 @@ Example responses:
83
86
Run config validation first:
84
87
85
88
```bash
86
-
cargo run -p bitloops-inference -- validate-config --config ./config.toml
89
+
cargo run -p bitloops-inference -- validate-config --config ./bitloops-daemon-config.toml
87
90
```
88
91
89
92
Describe a profile:
90
93
91
94
```bash
92
-
cargo run -p bitloops-inference -- describe-profile --config ./config.toml --profile ollama_local
95
+
cargo run -p bitloops-inference -- describe-profile --config ./bitloops-daemon-config.toml --profile ollama_local
93
96
```
94
97
95
98
Start the stdio runtime:
96
99
97
100
```bash
98
-
cargo run -p bitloops-inference -- run --config ./config.toml --profile ollama_local
101
+
cargo run -p bitloops-inference -- run --config ./bitloops-daemon-config.toml --profile ollama_local
99
102
```
100
103
101
104
You can then write protocol lines to `stdin` manually or from another process.
0 commit comments