13 Apr 17:14

github-actions

d332b77

v3.0.0-beta.16 Pre-release

Pre-release

3.0.0-beta.16 (2024-04-13)

Bug Fixes

fallback to general chat wrapper (#197) (7878c8a)

Features

inspect gpu command: print device names (#198) (5ca33c7)
inspect gpu command: print env info (#202) (d332b77)
download models using the CLI (#191) (b542b53)
interactively select a model from CLI commands (#191) (b542b53)
change the default log level to warn (#191) (b542b53)
token biases (#196) (3ad4494)

Shipped with llama.cpp release b2665

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

04 Apr 20:52

github-actions

v3.0.0-beta.15

6267778

v3.0.0-beta.15 Pre-release

Pre-release

3.0.0-beta.15 (2024-04-04)

Bug Fixes

create a context with no parameters (#188) (6267778)
improve chat wrappers tokenization (#182) (35e6f50)
use the new llama.cpp CUDA flag (#182) (35e6f50)
adapt to breaking llama.cpp changes (#183) (6b012a6)

Features

automatically adapt to current free VRAM state (#182) (35e6f50)
inspect gguf command (#182) (35e6f50)
inspect measure command (#182) (35e6f50)
readGgufFileInfo function (#182) (35e6f50)
GGUF file metadata info on LlamaModel (#182) (35e6f50)
JinjaTemplateChatWrapper (#182) (35e6f50)
use the tokenizer.chat_template header from the gguf file when available - use it to find a better specialized chat wrapper or use JinjaTemplateChatWrapper with it as a fallback (#182) (35e6f50)
simplify generation CLI commands: chat, complete, infill (#182) (35e6f50)
Windows on Arm prebuilt binary (#181) (f3b7f81)

Shipped with llama.cpp release b2608

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

21 Mar 20:18

github-actions

v2.8.9

6b012a6

v2.8.9

2.8.9 (2024-03-21)

Bug Fixes

adapt to breaking llama.cpp changes (#183) (6b012a6)

Assets 2

16 Mar 22:46

github-actions

v3.0.0-beta.14

315a3eb

v3.0.0-beta.14 Pre-release

Pre-release

3.0.0-beta.14 (2024-03-16)

Bug Fixes

DisposedError was thrown when calling .dispose() (#178) (315a3eb)
adapt to breaking llama.cpp changes (#178) (315a3eb)

Features

async model and context loading (#178) (315a3eb)
automatically try to resolve Failed to detect a default CUDA architecture CUDA compilation error (#178) (315a3eb)
detect cmake binary issues and suggest fixes on detection (#178) (315a3eb)

Shipped with llama.cpp release b2440

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

03 Mar 22:24

github-actions

v3.0.0-beta.13

5a70576

v3.0.0-beta.13 Pre-release

Pre-release

3.0.0-beta.13 (2024-03-03)

Bug Fixes

adapt to llama.cpp breaking change (#175) (5a70576)
return user-defined llama tokens (#175) (5a70576)

Features

gguf parser (#168) (bcaab4f)
use the best compute layer available by default (#175) (5a70576)
more guardrails to prevent loading an incompatible prebuilt binary (#175) (5a70576)
inspect command (#175) (5a70576)
GemmaChatWrapper (#175) (5a70576)
TemplateChatWrapper (#175) (5a70576)

Shipped with llama.cpp release b2329

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

24 Feb 22:46

github-actions

v3.0.0-beta.12

fa6cf2e

v3.0.0-beta.12 Pre-release

Pre-release

3.0.0-beta.12 (2024-02-24)

Bug Fixes

adapt to llama.cpp breaking changes (#166) (7450aae)
asset links (#170) (d841fff)

Features

Vulkan support (#171) (d161bcd)

Shipped with llama.cpp release b2254

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

23 Feb 12:26

github-actions

v2.8.8

d841fff

v2.8.8

2.8.8 (2024-02-23)

Bug Fixes

asset links (#170) (d841fff)

Assets 2

18 Feb 20:52

github-actions

v3.0.0-beta.11

624fa30

v3.0.0-beta.11 Pre-release

Pre-release

3.0.0-beta.11 (2024-02-18)

Features

completion and infill (#164) (ede69c1)
support configuring more options for getLlama when using "lastBuild" (#164) (ede69c1)
export resolveChatWrapperBasedOnWrapperTypeName (#165) (624fa30)

Shipped with llama.cpp release b2174

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

18 Feb 23:09

github-actions

v2.8.7

7450aae

v2.8.7

2.8.7 (2024-02-18)

Bug Fixes

adapt to llama.cpp breaking changes (#166) (7450aae)

Assets 2

11 Feb 23:35

github-actions

v3.0.0-beta.10

47b476f

v3.0.0-beta.10 Pre-release

Pre-release

3.0.0-beta.10 (2024-02-11)

Features

get VRAM state (#161) (46235a2)
chatWrapper getter on a LlamaChatSession (#161) (46235a2)
minP support (#162) (47b476f)

Shipped with llama.cpp release b2127

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

Uh oh!

Releases: withcatai/node-llama-cpp

v3.0.0-beta.16

3.0.0-beta.16 (2024-04-13)

Bug Fixes

Features

Uh oh!

v3.0.0-beta.15

3.0.0-beta.15 (2024-04-04)

Bug Fixes

Features

Uh oh!

v2.8.9

2.8.9 (2024-03-21)

Bug Fixes

Uh oh!

v3.0.0-beta.14

3.0.0-beta.14 (2024-03-16)

Bug Fixes

Features

Uh oh!

v3.0.0-beta.13

3.0.0-beta.13 (2024-03-03)

Bug Fixes

Features

Uh oh!

v3.0.0-beta.12

3.0.0-beta.12 (2024-02-24)

Bug Fixes

Features

Uh oh!

v2.8.8

2.8.8 (2024-02-23)

Bug Fixes

Uh oh!

v3.0.0-beta.11

3.0.0-beta.11 (2024-02-18)

Features

Uh oh!

v2.8.7

2.8.7 (2024-02-18)

Bug Fixes

Uh oh!

v3.0.0-beta.10

3.0.0-beta.10 (2024-02-11)

Features

Uh oh!