Skip to content

Tags: withcatai/node-llama-cpp

Tags

v3.18.1

Toggle v3.18.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat(minor): customize `postinstall` behavior (#582)

* feat: customize `postinstall` behavior
* feat: experimental support for context KV cache type configurations
* feat: support `NVFP4` quants

v3.18.0

Toggle v3.18.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat: automatic checkpoints for models that need it (#573)

* feat: automatic checkpoints for models that need it (such as Qwen 3.5 due to its hybrid architecture)
* feat(`QwenChatWrapper`): Qwen 3.5 support
* feat(`inspect gpu` command): detect and report missing prebuilt binary modules and custom npm registry
* feat: initial disk cache dir option for future optimizations (disabled for now)
* fix: Qwen 3.5 memory estimation
* fix: grammar use with HarmonyChatWrapper
* fix: add mistral think segment detection
* fix: compress excessively long segments from the current response on context shift instead of throwing an error
* fix: default thinking budget to 75% of the context size to prevent low-quality responses
* fix: bugs

v3.17.1

Toggle v3.17.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: Electron template (#566)

v3.17.0

Toggle v3.17.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat(`getLlama`): `build: "autoAttempt"` (#564)

* feat(`getLlama`): `build: "autoAttempt"`
* feat: get rid of octokit
* fix(CLI): disable Direct I/O by default
* fix: Bun segmentation fault on process exit with undisposed `Llama`
* fix: detect glibc inside Nix
* fix: stricter CI build flag
* chore: update `simple-git`
* chore: switch off of `tsconfig.json` deprecated configs
* docs: clarify `getLlama`'s `build` option logic

v3.16.2

Toggle v3.16.2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: macOS 14 prebuilt binaries (#559)

v3.16.1

Toggle v3.16.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: export missing types (#557)

v3.16.0

Toggle v3.16.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat: Exclude Top Choices (XTC) (#553)

* feat: Exclude Top Choices (XTC) support
* feat: DRY (Don't Repeat Yourself) repeat penalty support
* feat: Tiny Aya support
* fix: adjust the default VRAM padding config to reserve enough memory for compute buffers
* fix: adapt to breaking `llama.cpp` changes
* fix: support function call syntax with optional whitespace prefix
* fix: find the provided cmake path
* fix: change the default value of `useDirectIo` to `false`
* fix: Vulkan device dedupe

v3.15.1

Toggle v3.15.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: adapt to `llama.cpp` changes (#547)

* fix: adapt to `llama.cpp` changes
* fix: change the level of common logs

v3.15.0

Toggle v3.15.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat(`LlamaCompletion`): `stopOnAbortSignal` (#538)

* feat(`LlamaCompletion`): `stopOnAbortSignal`
* feat(`LlamaModel`): `useDirectIo`
* fix: support new CUDA 13.1 archs
* fix: build the prebuilt binaries with CUDA 13.1 instead of 13.0
* docs: stopping a text completion generation

v3.14.5

Toggle v3.14.5's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
docs: fix cmake dependencies link (#534)