Skip to content

Releases: withcatai/node-llama-cpp

v3.0.0-beta.16

13 Apr 17:14
d332b77

Choose a tag to compare

v3.0.0-beta.16 Pre-release
Pre-release

3.0.0-beta.16 (2024-04-13)

Bug Fixes

Features


Shipped with llama.cpp release b2665

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

v3.0.0-beta.15

04 Apr 20:52
6267778

Choose a tag to compare

v3.0.0-beta.15 Pre-release
Pre-release

3.0.0-beta.15 (2024-04-04)

Bug Fixes

Features

  • automatically adapt to current free VRAM state (#182) (35e6f50)
  • inspect gguf command (#182) (35e6f50)
  • inspect measure command (#182) (35e6f50)
  • readGgufFileInfo function (#182) (35e6f50)
  • GGUF file metadata info on LlamaModel (#182) (35e6f50)
  • JinjaTemplateChatWrapper (#182) (35e6f50)
  • use the tokenizer.chat_template header from the gguf file when available - use it to find a better specialized chat wrapper or use JinjaTemplateChatWrapper with it as a fallback (#182) (35e6f50)
  • simplify generation CLI commands: chat, complete, infill (#182) (35e6f50)
  • Windows on Arm prebuilt binary (#181) (f3b7f81)

Shipped with llama.cpp release b2608

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

v2.8.9

21 Mar 20:18
6b012a6

Choose a tag to compare

2.8.9 (2024-03-21)

Bug Fixes

v3.0.0-beta.14

16 Mar 22:46
315a3eb

Choose a tag to compare

v3.0.0-beta.14 Pre-release
Pre-release

3.0.0-beta.14 (2024-03-16)

Bug Fixes

  • DisposedError was thrown when calling .dispose() (#178) (315a3eb)
  • adapt to breaking llama.cpp changes (#178) (315a3eb)

Features

  • async model and context loading (#178) (315a3eb)
  • automatically try to resolve Failed to detect a default CUDA architecture CUDA compilation error (#178) (315a3eb)
  • detect cmake binary issues and suggest fixes on detection (#178) (315a3eb)

Shipped with llama.cpp release b2440

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

v3.0.0-beta.13

03 Mar 22:24
5a70576

Choose a tag to compare

v3.0.0-beta.13 Pre-release
Pre-release

3.0.0-beta.13 (2024-03-03)

Bug Fixes

Features


Shipped with llama.cpp release b2329

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

v3.0.0-beta.12

24 Feb 22:46
fa6cf2e

Choose a tag to compare

v3.0.0-beta.12 Pre-release
Pre-release

3.0.0-beta.12 (2024-02-24)

Bug Fixes

Features


Shipped with llama.cpp release b2254

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

v2.8.8

23 Feb 12:26
d841fff

Choose a tag to compare

2.8.8 (2024-02-23)

Bug Fixes

v3.0.0-beta.11

18 Feb 20:52
624fa30

Choose a tag to compare

v3.0.0-beta.11 Pre-release
Pre-release

3.0.0-beta.11 (2024-02-18)

Features

  • completion and infill (#164) (ede69c1)
  • support configuring more options for getLlama when using "lastBuild" (#164) (ede69c1)
  • export resolveChatWrapperBasedOnWrapperTypeName (#165) (624fa30)

Shipped with llama.cpp release b2174

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

v2.8.7

18 Feb 23:09
7450aae

Choose a tag to compare

2.8.7 (2024-02-18)

Bug Fixes

v3.0.0-beta.10

11 Feb 23:35
47b476f

Choose a tag to compare

v3.0.0-beta.10 Pre-release
Pre-release

3.0.0-beta.10 (2024-02-11)

Features


Shipped with llama.cpp release b2127

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)