copilot-swe-agent[bot]
6a1e44c8ff
Fix markdown parsing to handle multi-line constructs correctly
...
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-02-03 11:42:51 +00:00
copilot-swe-agent[bot]
bda40b266c
Initial plan
2026-02-03 11:40:19 +00:00
dependabot[bot]
c86edf06f2
chore(deps): bump github.com/onsi/gomega from 1.39.0 to 1.39.1 ( #8357 )
...
Bumps [github.com/onsi/gomega](https://github.com/onsi/gomega ) from 1.39.0 to 1.39.1.
- [Release notes](https://github.com/onsi/gomega/releases )
- [Changelog](https://github.com/onsi/gomega/blob/master/CHANGELOG.md )
- [Commits](https://github.com/onsi/gomega/compare/v1.39.0...v1.39.1 )
---
updated-dependencies:
- dependency-name: github.com/onsi/gomega
dependency-version: 1.39.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-03 09:00:59 +01:00
dependabot[bot]
5d14c2fe4d
chore(deps): bump sentence-transformers from 5.2.0 to 5.2.2 in /backend/python/transformers ( #8358 )
...
chore(deps): bump sentence-transformers in /backend/python/transformers
Bumps [sentence-transformers](https://github.com/huggingface/sentence-transformers ) from 5.2.0 to 5.2.2.
- [Release notes](https://github.com/huggingface/sentence-transformers/releases )
- [Commits](https://github.com/huggingface/sentence-transformers/compare/v5.2.0...v5.2.2 )
---
updated-dependencies:
- dependency-name: sentence-transformers
dependency-version: 5.2.2
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-03 08:37:05 +01:00
dependabot[bot]
4601143998
chore(deps): bump go.opentelemetry.io/otel/exporters/prometheus from 0.61.0 to 0.62.0 ( #8359 )
...
chore(deps): bump go.opentelemetry.io/otel/exporters/prometheus
Bumps [go.opentelemetry.io/otel/exporters/prometheus](https://github.com/open-telemetry/opentelemetry-go ) from 0.61.0 to 0.62.0.
- [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases )
- [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/exporters/prometheus/v0.61.0...exporters/prometheus/v0.62.0 )
---
updated-dependencies:
- dependency-name: go.opentelemetry.io/otel/exporters/prometheus
dependency-version: 0.62.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-03 08:36:34 +01:00
dependabot[bot]
7913ea2bfb
chore(deps): bump go.opentelemetry.io/otel/sdk/metric from 1.39.0 to 1.40.0 ( #8354 )
...
chore(deps): bump go.opentelemetry.io/otel/sdk/metric
Bumps [go.opentelemetry.io/otel/sdk/metric](https://github.com/open-telemetry/opentelemetry-go ) from 1.39.0 to 1.40.0.
- [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases )
- [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/v1.39.0...v1.40.0 )
---
updated-dependencies:
- dependency-name: go.opentelemetry.io/otel/sdk/metric
dependency-version: 1.40.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-03 08:36:10 +01:00
Ettore Di Giacinto
d6409bd2eb
Revert "chore(deps): bump torch from 2.7.0 to 2.7.1+xpu in /backend/python/vllm in the pip group across 1 directory" ( #8367 )
...
Revert "chore(deps): bump torch from 2.7.0 to 2.7.1+xpu in /backend/python/vl…"
This reverts commit 4c0e70086d .
2026-02-03 08:34:54 +01:00
dependabot[bot]
98872791e5
chore(deps): bump protobuf from 6.33.4 to 6.33.5 in /backend/python/transformers ( #8356 )
...
chore(deps): bump protobuf in /backend/python/transformers
Bumps [protobuf](https://github.com/protocolbuffers/protobuf ) from 6.33.4 to 6.33.5.
- [Release notes](https://github.com/protocolbuffers/protobuf/releases )
- [Commits](https://github.com/protocolbuffers/protobuf/commits )
---
updated-dependencies:
- dependency-name: protobuf
dependency-version: 6.33.5
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-03 08:33:45 +01:00
dependabot[bot]
c6d47cb4e5
chore(deps): bump github.com/anthropics/anthropic-sdk-go from 1.19.0 to 1.20.0 ( #8355 )
...
chore(deps): bump github.com/anthropics/anthropic-sdk-go
Bumps [github.com/anthropics/anthropic-sdk-go](https://github.com/anthropics/anthropic-sdk-go ) from 1.19.0 to 1.20.0.
- [Release notes](https://github.com/anthropics/anthropic-sdk-go/releases )
- [Changelog](https://github.com/anthropics/anthropic-sdk-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/anthropics/anthropic-sdk-go/compare/v1.19.0...v1.20.0 )
---
updated-dependencies:
- dependency-name: github.com/anthropics/anthropic-sdk-go
dependency-version: 1.20.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-03 08:33:24 +01:00
dependabot[bot]
f754a8edb1
chore(deps): bump go.opentelemetry.io/otel/metric from 1.39.0 to 1.40.0 ( #8353 )
...
Bumps [go.opentelemetry.io/otel/metric](https://github.com/open-telemetry/opentelemetry-go ) from 1.39.0 to 1.40.0.
- [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases )
- [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/v1.39.0...v1.40.0 )
---
updated-dependencies:
- dependency-name: go.opentelemetry.io/otel/metric
dependency-version: 1.40.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-03 08:33:04 +01:00
dependabot[bot]
4c0e70086d
chore(deps): bump torch from 2.7.0 to 2.7.1+xpu in /backend/python/vllm in the pip group across 1 directory ( #8360 )
...
chore(deps): bump torch
Bumps the pip group with 1 update in the /backend/python/vllm directory: torch.
Updates `torch` from 2.7.0 to 2.7.1+xpu
---
updated-dependencies:
- dependency-name: torch
dependency-version: 2.7.1+xpu
dependency-type: direct:production
dependency-group: pip
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-03 03:07:02 +00:00
dependabot[bot]
f8bd527dfe
chore(deps): bump appleboy/ssh-action from 1.2.4 to 1.2.5 ( #8352 )
...
Bumps [appleboy/ssh-action](https://github.com/appleboy/ssh-action ) from 1.2.4 to 1.2.5.
- [Release notes](https://github.com/appleboy/ssh-action/releases )
- [Commits](https://github.com/appleboy/ssh-action/compare/v1.2.4...v1.2.5 )
---
updated-dependencies:
- dependency-name: appleboy/ssh-action
dependency-version: 1.2.5
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-02-03 01:53:20 +00:00
Ettore Di Giacinto
08b2b8d755
fix(libbackend): do not inject --index-strategy unsafe-best-match to all
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-02-02 17:13:45 +01:00
Dream
10a1e6c74d
feat(whisperx): add whisperx backend for transcription with speaker diarization ( #8299 )
...
* feat(proto): add speaker field to TranscriptSegment for diarization
Add speaker field to the gRPC TranscriptSegment message and map it
through the Go schema, enabling backends to return speaker labels.
Signed-off-by: eureka928 <meobius123@gmail.com >
* feat(whisperx): add whisperx backend for transcription with diarization
Add Python gRPC backend using WhisperX for speech-to-text with
word-level timestamps, forced alignment, and speaker diarization
via pyannote-audio when HF_TOKEN is provided.
Signed-off-by: eureka928 <meobius123@gmail.com >
* feat(whisperx): register whisperx backend in Makefile
Signed-off-by: eureka928 <meobius123@gmail.com >
* feat(whisperx): add whisperx meta and image entries to index.yaml
Signed-off-by: eureka928 <meobius123@gmail.com >
* ci(whisperx): add build matrix entries for CPU, CUDA 12/13, and ROCm
Signed-off-by: eureka928 <meobius123@gmail.com >
* fix(whisperx): unpin torch versions and use CPU index for cpu requirements
Address review feedback:
- Use --extra-index-url for CPU torch wheels to reduce size
- Remove torch version pins, let uv resolve compatible versions
Signed-off-by: eureka928 <meobius123@gmail.com >
* fix(whisperx): pin torch ROCm variant to fix CI build failure
Signed-off-by: eureka928 <meobius123@gmail.com >
* fix(whisperx): pin torch CPU variant to fix uv resolution failure
Pin torch==2.8.0+cpu so uv resolves the CPU wheel from the extra
index instead of picking torch==2.8.0+cu128 from PyPI, which pulls
unresolvable CUDA dependencies.
Signed-off-by: eureka928 <meobius123@gmail.com >
* fix(whisperx): use unsafe-best-match index strategy to fix uv resolution failure
uv's default first-match strategy finds torch on PyPI before checking
the extra index, causing it to pick torch==2.8.0+cu128 instead of the
CPU variant. This makes whisperx's transitive torch dependency
unresolvable. Using unsafe-best-match lets uv consider all indexes.
Signed-off-by: eureka928 <meobius123@gmail.com >
* fix(whisperx): drop +cpu local version suffix to fix uv resolution failure
PEP 440 ==2.8.0 matches 2.8.0+cpu from the extra index, avoiding the
issue where uv cannot locate an explicit +cpu local version specifier.
This aligns with the pattern used by all other CPU backends.
Signed-off-by: eureka928 <meobius123@gmail.com >
* fix(backends): drop +rocm local version suffixes from hipblas requirements to fix uv resolution
uv cannot resolve PEP 440 local version specifiers (e.g. +rocm6.4,
+rocm6.3) in pinned requirements. The --extra-index-url already points
to the correct ROCm wheel index and --index-strategy unsafe-best-match
(set in libbackend.sh) ensures the ROCm variant is preferred.
Applies the same fix as 7f5d72e8 (which resolved this for +cpu) across
all 14 hipblas requirements files.
Signed-off-by: eureka928 <meobius123@gmail.com >
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
Signed-off-by: eureka928 <meobius123@gmail.com >
* revert: scope hipblas suffix fix to whisperx only
Reverts changes to non-whisperx hipblas requirements files per
maintainer review — other backends are building fine with the +rocm
local version suffix.
Signed-off-by: eureka928 <meobius123@gmail.com >
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
Signed-off-by: eureka928 <meobius123@gmail.com >
---------
Signed-off-by: eureka928 <meobius123@gmail.com >
Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com >
2026-02-02 16:33:12 +01:00
Alex O'Connell
b7585ca738
fix(api): Add missing field in initial OpenAI streaming response ( #8341 )
...
Add missing field in initial OpenAI streaming response
Signed-off-by: Alex O'Connell <35843486+acon96@users.noreply.github.com >
2026-02-02 08:30:04 +01:00
LocalAI [bot]
8cae99229c
chore: ⬆️ Update ggml-org/llama.cpp to 2634ed207a17db1a54bd8df0555bd8499a6ab691 ( #8336 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-02-01 21:23:57 +00:00
rampa3
04e0f444e1
chore(model gallery): Add Qwen 3 VL 8B thinking & instruct ( #8329 )
...
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
2026-02-01 17:34:31 +01:00
rampa3
6f410f4cbe
chore(model gallery): Rename downloaded filename for Magistral Small mmproj ( #8327 )
...
Rename downloaded filename for Magistral Small mmproj
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
2026-02-01 17:33:48 +01:00
Ettore Di Giacinto
800f749c7b
fix: drop gguf VRAM estimation (now redundant) ( #8325 )
...
fix: drop gguf VRAM estimation
Cleanup. This is now handled directly in llama.cpp, no need to estimate from Go.
VRAM estimation in general is tricky, but llama.cpp ( 41ea26144e/src/llama.cpp (L168) ) lately has added an automatic "fitting" of models to VRAM, so we can drop backend-specific GGUF VRAM estimation from our code instead of trying to guess as we already enable it
397f7f0862/backend/cpp/llama-cpp/grpc-server.cpp (L393)
Fixes: https://github.com/mudler/LocalAI/issues/8302
See: https://github.com/mudler/LocalAI/issues/8302#issuecomment-3830773472
2026-02-01 17:33:28 +01:00
Andres
b6459ddd57
feat(api): Add transcribe response format request parameter & adjust STT backends ( #8318 )
...
* WIP response format implementation for audio transcriptions
(cherry picked from commit e271dd764bbc13846accf3beb8b6522153aa276f)
Signed-off-by: Andres Smith <andressmithdev@pm.me >
* Rework transcript response_format and add more formats
(cherry picked from commit 6a93a8f63e2ee5726bca2980b0c9cf4ef8b7aeb8)
Signed-off-by: Andres Smith <andressmithdev@pm.me >
* Add test and replace go-openai package with official openai go client
(cherry picked from commit f25d1a04e46526429c89db4c739e1e65942ca893)
Signed-off-by: Andres Smith <andressmithdev@pm.me >
* Fix faster-whisper backend and refactor transcription formatting to also work on CLI
Signed-off-by: Andres Smith <andressmithdev@pm.me >
(cherry picked from commit 69a93977d5e113eb7172bd85a0f918592d3d2168)
Signed-off-by: Andres Smith <andressmithdev@pm.me >
---------
Signed-off-by: Andres Smith <andressmithdev@pm.me >
Co-authored-by: nanoandrew4 <nanoandrew4@gmail.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-02-01 17:33:17 +01:00
Ettore Di Giacinto
397f7f0862
fix(ui): take account of reasoning in token count calculation ( #8324 )
...
We were skipping reasoning traces when counting tokens, yielding to a
wrong sum count.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-02-01 10:48:31 +01:00
LocalAI [bot]
234072769c
chore(model gallery): 🤖 add 1 new models via gallery agent ( #8321 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-02-01 09:02:05 +01:00
LocalAI [bot]
3445415b3d
chore: ⬆️ Update ggml-org/llama.cpp to 41ea26144e55d23f37bb765f88c07588d786567f ( #8317 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-31 21:18:31 +00:00
LocalAI [bot]
b05e110aa6
chore: ⬆️ Update ggml-org/llama.cpp to 1488339138d609139c4400d1b80f8a5b1a9a203c ( #8306 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-31 08:59:09 +01:00
LocalAI [bot]
e69cba2444
chore: ⬆️ Update ggml-org/whisper.cpp to aa1bc0d1a6dfd70dbb9f60c11df12441e03a9075 ( #8305 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-31 08:58:54 +01:00
LocalAI [bot]
f7903597ac
chore(model-gallery): ⬆️ update checksum ( #8307 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-30 21:27:48 +01:00
LocalAI [bot]
ee76a0cd1c
feat(swagger): update swagger ( #8304 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-30 21:27:10 +01:00
Ettore Di Giacinto
4ca5b737bf
chore(cuda): target 12.8 for 12 to increase compatibility ( #8297 )
...
Some datacenter setups might be stuck with the 5.x kernel which doesn't
play well with CUDA >=12.9. To incrase compatibility with the CUDA 12.x
branch, downgrade to 12.8. For newer systems, it is still suggested to
use CUDA 13.x wherever compatible.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-30 12:58:44 +01:00
Ettore Di Giacinto
4077aaf978
chore: re-enable e2e tests, fixups anthropic API tools support ( #8296 )
...
* chore(tests): add mock backend e2e tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixup anthropic tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* prepare e2e tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop repetitive tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop specific CI workflow
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixup anthropic issues, move all e2e tests to use mocked backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-30 12:41:50 +01:00
Ettore Di Giacinto
68dd9765a0
feat(tts): add support for streaming mode ( #8291 )
...
* feat(tts): add support for streaming mode
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Send first audio, make sure it's 16
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-30 11:58:01 +01:00
LocalAI [bot]
2c44b06a67
chore: ⬆️ Update ggml-org/llama.cpp to 4fdbc1e4dba428ce0cf9d2ac22232dc170bbca82 ( #8283 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-29 23:43:29 +01:00
LocalAI [bot]
7cc90db3e5
chore(model-gallery): ⬆️ update checksum ( #8285 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-29 21:51:18 +01:00
Ettore Di Giacinto
1e08e02598
feat(qwen-asr): add support to qwen-asr ( #8281 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-29 21:50:35 +01:00
Richard Palethorpe
dd8e74a486
feat(realtime): Add audio conversations ( #6245 )
...
* feat(realtime): Add audio conversations
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* chore(realtime): Vendor the updated API and modify for server side
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* feat(realtime): Update to the GA realtime API
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* chore: Document realtime API and add docs to AGENTS.md
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* feat: Filter reasoning from spoken output
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(realtime): Send delta and done events for tool calls and audio transcripts
Ensure that content is sent in both deltas and done events for function call arguments and audio transcripts. This fixes compatibility with clients that rely on delta events for parsing.
💘 Generated with Crush
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(realtime): Improve tool call handling and error reporting
- Refactor Model interface to accept []types.ToolUnion and *types.ToolChoiceUnion
instead of JSON strings, eliminating unnecessary marshal/unmarshal cycles
- Fix Parameters field handling: support both map[string]any and JSON string formats
- Add PredictConfig() method to Model interface for accessing model configuration
- Add comprehensive debug logging for tool call parsing and function config
- Add missing return statement after prediction error (critical bug fix)
- Add warning logs for NoAction function argument parsing failures
- Improve error visibility throughout generateResponse function
💘 Generated with Crush
Assisted-by: Claude Sonnet 4.5 via Crush <crush@charm.land >
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2026-01-29 08:44:53 +01:00
Ettore Di Giacinto
48e08772f3
chore(llama.cpp): bump to 'f6b533d898ce84bae8d9fa8dfc6697ac087800bf' ( #8275 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-29 00:22:25 +01:00
LocalAI [bot]
c28c0227c6
chore: ⬆️ Update leejet/stable-diffusion.cpp to e411520407663e1ddf8ff2e5ed4ff3a116fbbc97 ( #8274 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-28 21:23:05 +00:00
Richard Palethorpe
856ca2d6b1
fix(qwen3): Be explicit with function calling format ( #8265 )
...
Qwen3 4b was using the wrong function format (i.e. using "function"
instead of "name") within the realtime API.
If we specify the function calling format explicitly then it stops it.
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2026-01-28 14:44:29 +01:00
Ettore Di Giacinto
9b973b79f6
feat: add VoxCPM tts backend ( #8109 )
...
* feat: add VoxCPM tts backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Disable voxcpm on arm64 cpu
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-28 14:44:04 +01:00
Ettore Di Giacinto
cba8ef4e38
chore: fix backend icons
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-28 09:09:00 +01:00
Ettore Di Giacinto
f729e300d6
chore: fix backend icons
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-28 09:08:03 +01:00
LocalAI [bot]
9916811a79
chore: ⬆️ Update ggml-org/llama.cpp to 2b4cbd2834e427024bc7f935a1f232aecac6679b ( #8258 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-28 08:50:16 +01:00
Ettore Di Giacinto
2f7c595cd1
chore(model gallery): add z-image and z-image-turbo for diffusers ( #8260 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-27 22:42:10 +01:00
rampa3
73decac746
chore(model gallery): Add mistral-community/pixtral-12b with mmproj ( #8245 )
...
Rebased branch add_pixtral on master
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
2026-01-27 21:43:31 +01:00
Ettore Di Giacinto
ec1598868b
feat(vibevoice): add ASR support ( #8222 )
...
* feat(vibevoice): add ASR support
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(tests): download voice files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to run on bigger runner
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* debug
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* CI can't hold vibevoice
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-27 20:19:22 +01:00
rampa3
93d7e5d4b8
chore(model gallery): Add entry for Magistral Small 1.2 with mmproj ( #8248 )
...
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
2026-01-27 16:55:00 +01:00
rampa3
ff5a54b9d1
chore(model gallery): Add entry for Mistral Small 3.1 with mmproj ( #8247 )
...
* chore(model gallery): Add entry for Mistral Small 3.1 with mmproj
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
* Use llama-cpp subfolder structure akin to Qwen 3 VL
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
---------
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
2026-01-27 16:54:14 +01:00
LocalAI [bot]
3c1f823c47
chore: ⬆️ Update ggml-org/llama.cpp to 8f80d1b254aef70a0959e314be368d05debe7294 ( #8229 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-26 21:19:43 +00:00
LocalAI [bot]
4024220d00
chore(model gallery): 🤖 add 1 new models via gallery agent ( #8220 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-26 12:11:24 +01:00
LocalAI [bot]
f76958d761
chore: ⬆️ Update ggml-org/llama.cpp to 0440bfd1605333726ea0fb7a836942660bf2f9a6 ( #8216 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-26 00:50:35 +01:00
LocalAI [bot]
2bd5ca45de
chore: ⬆️ Update leejet/stable-diffusion.cpp to 43e829f21966abb96b08c712bccee872dc820914 ( #8215 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-26 00:50:16 +01:00
Ettore Di Giacinto
6804ce1c39
chore(docs): change MEMORY_FILE_PATH to MEMORY_INDEX_PATH
...
Updated MEMORY_FILE_PATH to MEMORY_INDEX_PATH in memory configuration.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-25 22:14:11 +01:00
Dedy F. Setyawan
d499071bff
fix(ui): correctly display selected image model ( #8208 )
...
Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com >
2026-01-25 14:54:40 +01:00
Ettore Di Giacinto
26a374b717
chore: drop bark which is unmaintained ( #8207 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-25 09:26:40 +01:00
rampa3
980de0e25b
chore(model gallery): Add most of not yet present Piper voices from Hugging Face ( #8202 )
...
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
2026-01-25 08:56:53 +01:00
Ettore Di Giacinto
4767371aee
chore(README): Add links
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-24 22:49:27 +01:00
Ettore Di Giacinto
131d247b78
chore(README): Update and simplify links
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-24 22:46:40 +01:00
Ettore Di Giacinto
b2a8a63899
feat(vllm-omni): add new backend ( #8188 )
...
* feat(vllm-omni: add new backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* default to py3.12
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-24 22:23:30 +01:00
LocalAI [bot]
05a332cd5f
chore: ⬆️ Update ggml-org/llama.cpp to bb02f74c612064947e51d23269a1cf810b67c9a7 ( #8196 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-24 21:19:43 +00:00
Ettore Di Giacinto
05904c77f5
chore(exllama): drop backend now almost deprecated ( #8186 )
...
exllama2 development has stalled and only old architectures are
supported. exllamav3 is still in development, meanwhile cleaning up
exllama2 from the gallery.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-24 08:57:37 +01:00
LocalAI [bot]
17783fa7d9
chore: ⬆️ Update leejet/stable-diffusion.cpp to fa61ea744d1a87fa26a63f8a86e45587bc9534d6 ( #8184 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-24 08:57:24 +01:00
LocalAI [bot]
4019094111
chore: ⬆️ Update ggml-org/llama.cpp to 557515be1e93ed8939dd8a7c7d08765fdbe8be31 ( #8183 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-24 08:57:08 +01:00
Ettore Di Giacinto
ca65fc751a
chore(model gallery): add qwen3-tts to model gallery ( #8187 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-23 23:06:50 +01:00
LocalAI [bot]
a1e3acc590
docs: ⬆️ update docs version mudler/LocalAI ( #8182 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-23 22:03:47 +01:00
Ettore Di Giacinto
a36960e069
fix(qwen-tts): change icon URL in index.yaml
...
Updated the icon URL for the project.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-23 22:00:14 +01:00
Ettore Di Giacinto
58bb6a29ed
Revert "chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/bark in the pip group across 1 directory" ( #8180 )
...
Revert "chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/ba…"
This reverts commit 5881c82413 .
2026-01-23 17:25:04 +01:00
dependabot[bot]
5881c82413
chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/bark in the pip group across 1 directory ( #8175 )
...
chore(deps): bump torch
Bumps the pip group with 1 update in the /backend/python/bark directory: torch.
Updates `torch` from 2.4.1 to 2.7.1+xpu
---
updated-dependencies:
- dependency-name: torch
dependency-version: 2.7.1+xpu
dependency-type: direct:production
dependency-group: pip
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-23 15:32:15 +00:00
Ettore Di Giacinto
923ebbb344
feat(qwen-tts): add Qwen-tts backend ( #8163 )
...
* feat(qwen-tts): add Qwen-tts backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update intel deps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop flash-attn for cuda13
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-23 15:18:41 +01:00
LocalAI [bot]
ea51567b89
chore(model gallery): 🤖 add 1 new models via gallery agent ( #8170 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-23 08:19:39 +01:00
LocalAI [bot]
552c62a19c
chore: ⬆️ Update leejet/stable-diffusion.cpp to 5e4579c11d0678f9765463582d024e58270faa9c ( #8166 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-23 08:18:05 +01:00
Ettore Di Giacinto
c0b21a921b
feat: detect thinking support from backend automatically if not explicitly set ( #8167 )
...
detect thinking support from backend automatically if not explicitly set
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-23 00:38:28 +01:00
LocalAI [bot]
b10045adc2
chore: ⬆️ Update ggml-org/llama.cpp to a5eaa1d6a3732bc0f460b02b61c95680bba5a012 ( #8165 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-22 23:32:05 +00:00
Ettore Di Giacinto
61b5e3b629
chore: drop test file
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-22 22:19:38 +00:00
Ettore Di Giacinto
e35d7cb3b3
chore: drop test file
...
the function now was removed
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-22 21:47:52 +00:00
Ettore Di Giacinto
0fa0ac4797
fix(videogen): drop incomplete endpoint, add GGUF support for LTX-2 ( #8160 )
...
* Debug
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop openai video endpoint (is not complete)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add download button
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-22 14:09:20 +01:00
LocalAI [bot]
be7ed85838
chore(model gallery): 🤖 add 1 new models via gallery agent ( #8157 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-22 08:25:40 +01:00
LocalAI [bot]
c12b310028
chore: ⬆️ Update ggml-org/llama.cpp to c301172f660a1fe0b42023da990bf7385d69adb4 ( #8151 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-22 00:51:22 +01:00
LocalAI [bot]
0447d5564d
chore: ⬆️ Update leejet/stable-diffusion.cpp to 329571131d62d64a4f49e1acbef49ae02544fdcd ( #8152 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-22 00:50:41 +01:00
Ettore Di Giacinto
22c0eb5421
chore(diffusers): add 'av' to requirements.txt ( #8155 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-21 22:35:00 +01:00
LocalAI [bot]
a0a00fb937
chore(model-gallery): ⬆️ update checksum ( #8153 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 21:45:11 +01:00
LocalAI [bot]
6dd44742ea
feat(swagger): update swagger ( #8150 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 21:44:44 +01:00
Richard Palethorpe
00c72e7d3e
fix(tracing): Create trace buffer on first request to enable tracing at runtime ( #8148 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2026-01-21 18:39:39 +01:00
LocalAI [bot]
d01c335cf6
chore: ⬆️ Update ggml-org/whisper.cpp to 7aa8818647303b567c3a21fe4220b2681988e220 ( #8146 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 17:44:01 +01:00
LocalAI [bot]
5687df4535
chore: ⬆️ Update ggml-org/llama.cpp to ad8d85bd94cc86e89d23407bdebf98f2e6510c61 ( #8145 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 15:41:36 +00:00
Ettore Di Giacinto
f5fade97e6
chore: drop noisy logs ( #8142 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 09:52:20 +01:00
Ettore Di Giacinto
b88ae31e4e
chore(model gallery): add flux 2 and flux 2 klein ( #8141 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 09:46:33 +01:00
Ettore Di Giacinto
f6daaa7c35
chore(deps): Bump llama.cpp to '1c7cf94b22a9dc6b1d32422f72a627787a4783a3' ( #8136 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 00:12:13 +01:00
Ettore Di Giacinto
c491c6ca90
feat(openresponses): Support reasoning blocks ( #8133 )
...
* feat(openresponses): support reasoning blocks
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* allow to disable reasoning, refactor common logic
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add option to only strip reasoning
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add configurations for custom reasoning tokens
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 00:11:45 +01:00
Ettore Di Giacinto
34e054f607
fix(reasoning): support models with reasoning without starting thinking tag ( #8132 )
...
* chore: extract reasoning to its own package
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* make sure we detect thinking tokens from template
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Allow to override via config, add tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-20 21:07:59 +01:00
LocalAI [bot]
e886bb291a
chore(model gallery): 🤖 add 1 new models via gallery agent ( #8128 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-20 12:58:29 +01:00
Ettore Di Giacinto
4bf2f8bbd8
chore(docs): update docs with Anthropic API and openresponses
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-20 09:25:24 +01:00
LocalAI [bot]
d3525b7509
chore: ⬆️ Update ggml-org/llama.cpp to 959ecf7f234dc0bc0cd6829b25cb0ee1481aa78a ( #8122 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-19 22:50:47 +01:00
LocalAI [bot]
c8aa821e0e
chore: ⬆️ Update leejet/stable-diffusion.cpp to a48b4a3ade9972faf0adcad47e51c6fc03f0e46d ( #8121 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-19 22:27:46 +01:00
dependabot[bot]
b3191927ae
chore(deps): bump github.com/mudler/cogito from 0.7.2 to 0.8.1 ( #8124 )
...
Bumps [github.com/mudler/cogito](https://github.com/mudler/cogito ) from 0.7.2 to 0.8.1.
- [Release notes](https://github.com/mudler/cogito/releases )
- [Commits](https://github.com/mudler/cogito/compare/v0.7.2...v0.8.1 )
---
updated-dependencies:
- dependency-name: github.com/mudler/cogito
dependency-version: 0.8.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-19 22:26:26 +01:00
LocalAI [bot]
54c5a2d9ea
docs: ⬆️ update docs version mudler/LocalAI ( #8120 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-19 21:18:24 +00:00
Ettore Di Giacinto
0279591fec
Enable reranking for Qwen3-VL-Reranker-8B
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-19 15:28:58 +01:00
LocalAI [bot]
8845186955
chore: ⬆️ Update leejet/stable-diffusion.cpp to 2efd19978dd4164e387bf226025c9666b6ef35e2 ( #8099 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-18 22:40:35 +01:00
LocalAI [bot]
ab8ed24358
chore: ⬆️ Update ggml-org/llama.cpp to 287a33017b32600bfc0e81feeb0ad6e81e0dd484 ( #8100 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-18 22:40:14 +01:00
LocalAI [bot]
a021df5a88
feat(swagger): update swagger ( #8098 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-18 22:10:06 +01:00
Ettore Di Giacinto
5f403b1631
chore: drop neutts for l4t ( #8101 )
...
Builds exhausts CI currently, and there are better backends at this
point in time. We will probably deprecate it in the future.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-18 21:55:56 +01:00
rampa3
897ad1729e
chore(model gallery): add qwen3-coder-30b-a3b-instruct based on model request ( #8082 )
...
* chore(model gallery): add qwen3-coder-30b-a3b-instruct based on model request
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
* added missing model config import URL
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
---------
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
2026-01-18 09:23:07 +01:00
LocalAI [bot]
16a18a2e55
chore: ⬆️ Update leejet/stable-diffusion.cpp to 9565c7f6bd5fcff124c589147b2621244f2c4aa1 ( #8086 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-17 22:12:21 +01:00
Ettore Di Giacinto
3387bfaee0
feat(api): add support for open responses specification ( #8063 )
...
* feat: openresponses
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add ttl settings, fix tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: register cors middleware by default
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* satisfy schema
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Logitbias and logprobs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add grammar
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* SSE compliance
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* tool JSON conversion
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* support background mode
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* swagger
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* drop code. This is handled in the handler
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small refactorings
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* background mode for MCP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-17 22:11:47 +01:00
LocalAI [bot]
1cd33047b4
chore: ⬆️ Update ggml-org/llama.cpp to 2fbde785bc106ae1c4102b0e82b9b41d9c466579 ( #8087 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-17 21:10:18 +00:00
Ettore Di Giacinto
1de045311a
chore(ui): add video generation link ( #8079 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-17 09:49:47 +01:00
LocalAI [bot]
5fe9bf9f84
chore: ⬆️ Update ggml-org/whisper.cpp to f53dc74843e97f19f94a79241357f74ad5b691a6 ( #8074 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-17 08:32:53 +01:00
LocalAI [bot]
d4fd0c0609
chore: ⬆️ Update ggml-org/llama.cpp to 388ce822415f24c60fcf164a321455f1e008cafb ( #8073 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-16 21:22:33 +00:00
Ettore Di Giacinto
d16722ee13
Revert "chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/python/rerankers in the pip group across 1 directory" ( #8072 )
...
Revert "chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/pyt…"
This reverts commit 1f10ab39a9 .
2026-01-16 20:50:33 +01:00
dependabot[bot]
1f10ab39a9
chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/python/rerankers in the pip group across 1 directory ( #8066 )
...
chore(deps): bump torch
Bumps the pip group with 1 update in the /backend/python/rerankers directory: [torch](https://github.com/pytorch/pytorch ).
Updates `torch` from 2.3.1+cxx11.abi to 2.8.0
- [Release notes](https://github.com/pytorch/pytorch/releases )
- [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md )
- [Commits](https://github.com/pytorch/pytorch/commits/v2.8.0 )
---
updated-dependencies:
- dependency-name: torch
dependency-version: 2.8.0
dependency-type: direct:production
dependency-group: pip
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-16 19:38:12 +00:00
Ettore Di Giacinto
4d36e393d1
fix(ci): use more beefy runner for expensive jobs ( #8065 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-16 19:26:40 +01:00
LocalAI [bot]
cb8616c7d1
chore: ⬆️ Update ggml-org/llama.cpp to 785a71008573e2d84728fb0ba9e851d72d3f8fab ( #8053 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-15 22:53:17 +01:00
LocalAI [bot]
ff31d50488
chore: ⬆️ Update ggml-org/whisper.cpp to 2eeeba56e9edd762b4b38467bab96c2517163158 ( #8052 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-15 22:52:56 +01:00
Divyanshupandey007
1a50717e33
fix: reduce log verbosity for /api/operations polling ( #8050 )
...
* fix: reduce log verbosity for /api/operations polling
Reduces log clutter by changing the log level from INFO to DEBUG for successful (200 OK) /api/operations requests. This endpoint is polled frequently by the Web UI, causing log spam. Fixes #7989 .
* fix: reduce log verbosity for /api/operations polling
Reduces log clutter by changing the log level from INFO to DEBUG for successful (200 OK) /api/operations requests. This endpoint is polled frequently by the Web UI, causing log spam. Fixes #7989 .
2026-01-15 21:13:13 +01:00
LocalAI [bot]
49d6305509
chore: ⬆️ Update ggml-org/llama.cpp to d98b548120eecf98f0f6eaa1ba7e29b3afda9f2e ( #8040 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-15 08:39:46 +01:00
Ettore Di Giacinto
d20a113aef
fix(functions): do not duplicate function when valid JSON is inside XML tags ( #8043 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-14 23:42:00 +01:00
LocalAI [bot]
cbaa793520
chore: ⬆️ Update ggml-org/whisper.cpp to 47af2fb70f7e4ee1ba40c8bed513760fdfe7a704 ( #8039 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-14 22:12:32 +01:00
Ettore Di Giacinto
6fe3fc880f
Update section headers in README.md for clarity
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-14 22:11:58 +01:00
Ettore Di Giacinto
752e641c48
Clarify Docker usage in README
...
Updated Docker section in README to clarify usage.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-14 22:10:59 +01:00
Ettore Di Giacinto
44d78b4d15
chore(doc): put alert on install.sh until is fixed ( #8042 )
...
See: https://github.com/mudler/LocalAI/issues/8032
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-14 22:08:48 +01:00
Ettore Di Giacinto
64d0a96ba3
feat(ui): add video gen UI ( #8020 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-14 11:43:32 +01:00
Ettore Di Giacinto
b19afc9e64
feat(diffusers): add support to LTX-2 ( #8019 )
...
* feat(diffusers): add support to LTX-2
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add to the gallery
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-14 09:07:30 +01:00
LocalAI [bot]
d6e698876b
chore: ⬆️ Update ggml-org/llama.cpp to e4832e3ae4d58ac0ecbdbf4ae055424d6e628c9f ( #8015 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-14 08:09:37 +01:00
LocalAI [bot]
8962205546
chore: ⬆️ Update ggml-org/whisper.cpp to a96310871a3b294f026c3bcad4e715d17b5905fe ( #8014 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-14 08:09:00 +01:00
LocalAI [bot]
eddc460118
chore: ⬆️ Update leejet/stable-diffusion.cpp to 7010bb4dff7bd55b03d35ef9772142c21699eba9 ( #8013 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-14 08:08:31 +01:00
Ettore Di Giacinto
a6ff354c86
feat(tts): add pocket-tts backend ( #8018 )
...
* feat(pocket-tts): add new backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add to the gallery
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-13 23:35:19 +01:00
dependabot[bot]
3a2be4df48
chore(deps): bump github.com/onsi/ginkgo/v2 from 2.27.3 to 2.27.5 ( #8004 )
...
Bumps [github.com/onsi/ginkgo/v2](https://github.com/onsi/ginkgo ) from 2.27.3 to 2.27.5.
- [Release notes](https://github.com/onsi/ginkgo/releases )
- [Changelog](https://github.com/onsi/ginkgo/blob/master/CHANGELOG.md )
- [Commits](https://github.com/onsi/ginkgo/compare/v2.27.3...v2.27.5 )
---
updated-dependencies:
- dependency-name: github.com/onsi/ginkgo/v2
dependency-version: 2.27.5
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-13 09:06:20 +01:00
dependabot[bot]
4e1f448e86
chore(deps): bump fyne.io/fyne/v2 from 2.7.1 to 2.7.2 ( #8003 )
...
Bumps [fyne.io/fyne/v2](https://github.com/fyne-io/fyne ) from 2.7.1 to 2.7.2.
- [Release notes](https://github.com/fyne-io/fyne/releases )
- [Changelog](https://github.com/fyne-io/fyne/blob/master/CHANGELOG.md )
- [Commits](https://github.com/fyne-io/fyne/compare/v2.7.1...v2.7.2 )
---
updated-dependencies:
- dependency-name: fyne.io/fyne/v2
dependency-version: 2.7.2
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-13 08:45:58 +01:00
dependabot[bot]
3e0168360a
chore(deps): bump github.com/gpustack/gguf-parser-go from 0.22.1 to 0.23.1 ( #8001 )
...
chore(deps): bump github.com/gpustack/gguf-parser-go
Bumps [github.com/gpustack/gguf-parser-go](https://github.com/gpustack/gguf-parser-go ) from 0.22.1 to 0.23.1.
- [Release notes](https://github.com/gpustack/gguf-parser-go/releases )
- [Commits](https://github.com/gpustack/gguf-parser-go/compare/v0.22.1...v0.23.1 )
---
updated-dependencies:
- dependency-name: github.com/gpustack/gguf-parser-go
dependency-version: 0.23.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-13 08:45:35 +01:00
dependabot[bot]
ea4157887b
chore(deps): bump github.com/onsi/gomega from 1.38.3 to 1.39.0 ( #8000 )
...
Bumps [github.com/onsi/gomega](https://github.com/onsi/gomega ) from 1.38.3 to 1.39.0.
- [Release notes](https://github.com/onsi/gomega/releases )
- [Changelog](https://github.com/onsi/gomega/blob/master/CHANGELOG.md )
- [Commits](https://github.com/onsi/gomega/compare/v1.38.3...v1.39.0 )
---
updated-dependencies:
- dependency-name: github.com/onsi/gomega
dependency-version: 1.39.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-13 08:45:18 +01:00
dependabot[bot]
699c50be47
chore(deps): bump github.com/mudler/go-processmanager from 0.0.0-20240820160718-8b802d3ecf82 to 0.1.0 ( #7992 )
...
chore(deps): bump github.com/mudler/go-processmanager
Bumps [github.com/mudler/go-processmanager](https://github.com/mudler/go-processmanager ) from 0.0.0-20240820160718-8b802d3ecf82 to 0.1.0.
- [Release notes](https://github.com/mudler/go-processmanager/releases )
- [Commits](https://github.com/mudler/go-processmanager/commits/v0.1.0 )
---
updated-dependencies:
- dependency-name: github.com/mudler/go-processmanager
dependency-version: 0.1.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-13 08:44:53 +01:00
dependabot[bot]
94eecc43a3
chore(deps): bump protobuf from 6.33.2 to 6.33.4 in /backend/python/transformers ( #7993 )
...
chore(deps): bump protobuf in /backend/python/transformers
Bumps [protobuf](https://github.com/protocolbuffers/protobuf ) from 6.33.2 to 6.33.4.
- [Release notes](https://github.com/protocolbuffers/protobuf/releases )
- [Commits](https://github.com/protocolbuffers/protobuf/commits )
---
updated-dependencies:
- dependency-name: protobuf
dependency-version: 6.33.4
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-12 23:46:32 +00:00
LocalAI [bot]
7e35ec6c4f
chore: ⬆️ Update ggml-org/llama.cpp to bcf7546160982f56bc290d2e538544bbc0772f63 ( #7991 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-12 21:14:33 +00:00
Ettore Di Giacinto
7891c33cb1
chore(vulkan): bump vulkan-sdk to 1.4.335.0 ( #7981 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-12 07:51:26 +01:00
Ettore Di Giacinto
271cc79709
chore(backends): do not bundle cuda target directory ( #7982 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-12 07:51:09 +01:00
LocalAI [bot]
3d12d5e70d
chore: ⬆️ Update leejet/stable-diffusion.cpp to 885e62ea822e674c6837a8225d2d75f021b97a6a ( #7979 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-11 22:44:11 +01:00
LocalAI [bot]
bc180c2638
chore: ⬆️ Update ggml-org/llama.cpp to 0c3b7a9efebc73d206421c99b7eb6b6716231322 ( #7978 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-11 22:06:30 +01:00
Ettore Di Giacinto
2de30440fe
fix(l4t-12): use pip to install python deps ( #7967 )
...
* fix: install only torch/torchvision from jetson index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: use pip for l4t-12
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Revert "fix: install only torch/torchvision from jetson index"
This reverts commit 2d2b020078
* chatterbox needs wheel
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-11 00:21:32 +01:00
Copilot
673a80a578
feat: Filter backend gallery by system capabilities ( #7950 )
...
* Initial plan
* Add backend gallery filtering based on system capabilities
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Refactor L4T backend check to come before NVIDIA check
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Refactor: move capabilities business logic to capabilities.go and use constants
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* feat: display system capability in webui and refactor tests
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* chore: rename System/Capability
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* refactor: use getSystemCapabilities in IsBackendCompatible for consistency
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* refactor: keep unused constants private in capabilities.go
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* fix: skip AMD/ROCm and Intel/SYCL tests on darwin
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-10 23:34:01 +01:00
Jon Roeber
2554e9fabe
fix(model): do not assume success when deleting a model process ( #7963 )
...
* fix(model): do not assume success when deleting a model process
Signed-off-by: Jon Roeber <jon@roeber.dev >
* Update pkg/model/process.go
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
Signed-off-by: Jon Roeber <65431671+jroeber@users.noreply.github.com >
---------
Signed-off-by: Jon Roeber <jon@roeber.dev >
Signed-off-by: Jon Roeber <65431671+jroeber@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-10 23:33:44 +01:00
LocalAI [bot]
5bfc3eebf8
chore: ⬆️ Update ggml-org/llama.cpp to b1377188784f9aea26b8abde56d4aee8c733eec7 ( #7965 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-10 22:24:26 +01:00
LocalAI [bot]
ab893fe302
feat(swagger): update swagger ( #7964 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-10 21:46:23 +01:00
Ettore Di Giacinto
c88074a19e
feat(api): support 'reasoning' api field ( #7959 )
...
This PR adds support to support the 'reasoning' API field of the OpenAI
spec.
LocalAI now will extract automatically thinking tags in both SSE and
non-SSE mode. The changes are adapted as well to the Chat UI now that
will use the reasoning field to extract the thinking process and display
it in the chat.
This fixes https://github.com/mudler/LocalAI/issues/7944
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-10 19:06:12 +01:00
Copilot
5ca8f0aea0
feat: add tool/function calling support to Anthropic Messages API ( #7956 )
...
* Initial plan
* Add tool/function calling schema support to Anthropic Messages API
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Add E2E tests for Anthropic tool calling
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Make tool calling tests require model to use tools
- First test now expects hasToolUse to be true with clear error message
- Third test now expects toolUseID to be non-empty (removed conditional)
- Both tests will now fail if model doesn't call the expected tools
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Add E2E test for tool calling with streaming responses
- Tests that streaming events are properly emitted (content_block_start/delta/stop)
- Verifies tool_use blocks are accumulated correctly in streaming mode
- Ensures model calls tools and stop_reason is set to tool_use
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-10 18:44:22 +01:00
LocalAI [bot]
84234e531f
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7954 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-10 12:34:23 +01:00
Copilot
4cbf9abfef
feat: Add Anthropic Messages API support ( #7948 )
...
* Initial plan
* Add Anthropic Messages API support
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Fix code review comments: add error handling for JSON operations
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Fix test suite to use existing schema test runner
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Add Anthropic e2e tests using anthropic-sdk-go for streaming and non-streaming
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-10 12:33:05 +01:00
LocalAI [bot]
fdc2c0737c
chore: ⬆️ Update ggml-org/llama.cpp to 593da7fa49503b68f9f01700be9f508f1e528992 ( #7946 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-09 21:13:04 +00:00
Ettore Di Giacinto
f4b0a304d7
chore(llama.cpp): propagate errors during model load ( #7937 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-09 07:52:49 +01:00
Ettore Di Giacinto
d16ec7aa9e
chore(deps): Bump llama.cpp to '480160d47297df43b43746294963476fc0a6e10f' ( #7933 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-09 07:52:32 +01:00
Ettore Di Giacinto
d699b7ccdc
Add backend configuration for Granite embedding model
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-09 00:44:10 +01:00
Ettore Di Giacinto
a4d224dd1b
Revert "chore(uv): add --index-strategy=unsafe-first-match to l4t" ( #7936 )
...
Revert "chore(uv): add --index-strategy=unsafe-first-match to l4t (#7934 )"
This reverts commit f5dee90962 .
2026-01-08 23:31:51 +01:00
Ettore Di Giacinto
917c7aa9f3
chore(ci): roll back l4t-cuda12 configurations ( #7935 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-08 23:04:33 +01:00
LocalAI [bot]
5aa66842dd
chore: ⬆️ Update leejet/stable-diffusion.cpp to 0e52afc6513cc2dea9a1a017afc4a008d5acf2b0 ( #7930 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-08 22:48:46 +01:00
Ettore Di Giacinto
f5dee90962
chore(uv): add --index-strategy=unsafe-first-match to l4t ( #7934 )
...
This is because the main index might not contain all the dependencies
for torch
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-08 22:48:03 +01:00
Copilot
06323df457
Optimize GPU library copying to preserve symlinks and avoid duplicates ( #7931 )
...
* Initial plan
* Optimize library copying to preserve symlinks and avoid duplicates
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Address code review feedback: extract get_inode helper, use file type detection for sorting
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Simplify implementation by removing inode tracking
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Add clarifying comment about basename deduplication
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-08 22:26:48 +01:00
Richard Palethorpe
98f28bf583
chore(docs): Add Crush and VoxInput to the integrations ( #7924 )
...
* chore(docs): Add Crush and VoxInput to the integrations
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* Apply suggestion from @mudler
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-08 21:39:25 +01:00
Ettore Di Giacinto
383312b50e
chore(l4t-12): do not use python 3.12 (wheels are only for 3.10) ( #7928 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-08 19:00:07 +01:00
Ettore Di Giacinto
b736db4bbe
chore(ci): use latest jetpack image for l4t ( #7926 )
...
This image is for HW prior Jetpack 7. Jetpack 7 broke compatibility with
older devices (which are still in use) such as AGX Orin or Jetsons.
While we do have l4t-cuda-13 images with sbsa support for new Nvidia
devices (Thor, DGX, etc). For older HW we are forced to keep old images
around as 24.04 does not seem to be supported.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-08 18:30:59 +01:00
LocalAI [bot]
09bc2e4a00
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7922 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-08 11:06:21 +01:00
LocalAI [bot]
c03e532a18
chore: ⬆️ Update ggml-org/llama.cpp to ae9f8df77882716b1702df2bed8919499e64cc28 ( #7915 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-07 23:24:01 +01:00
Ettore Di Giacinto
fcb58ee243
fix(intel): Add ARG for Ubuntu codename in Dockerfile ( #7917 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-07 21:55:18 +01:00
Copilot
b2ff1cea2a
feat: enable Vulkan arm64 image builds ( #7912 )
...
* Initial plan
* Add arm64 support for Vulkan builds in Dockerfiles and workflows
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-07 21:49:50 +01:00
Ettore Di Giacinto
b964b3d53e
feat(backends): add moonshine backend for faster transcription ( #7833 )
...
* feat(backends): add moonshine backend for faster transcription
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add backend to CI, update AGENTS.md from this exercise
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-07 21:44:35 +01:00
LocalAI [bot]
0b26669d0b
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7916 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-07 21:43:39 +01:00
Ettore Di Giacinto
5a9698bc69
chore(Dockerfile): restore GPU vendor specific sections ( #7911 )
...
Until we figure out https://github.com/mudler/LocalAI/issues/7909
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-07 16:34:23 +01:00
Ettore Di Giacinto
1fe0e9f74f
chore(ci): restore building of GPU vendor images ( #7910 )
...
Until we figure out https://github.com/mudler/LocalAI/issues/7909
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-07 16:32:22 +01:00
Ettore Di Giacinto
ffb2dc4666
chore(detection): detect GPU vendor from files present in the system ( #7908 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-07 16:18:27 +01:00
Ettore Di Giacinto
cfc2225fc7
chore(dockerfile): drop driver-requirements section ( #7907 )
...
* chore(dockerfile): drop driver-requirements section
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(ci): drop other builds
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-07 16:18:14 +01:00
Copilot
fd53978a7b
feat: package GPU libraries inside backend containers for unified base image ( #7891 )
...
* Initial plan
* Add GPU library packaging for isolated backend environments
- Create scripts/build/package-gpu-libs.sh for packaging CUDA, ROCm, SYCL, and Vulkan libraries
- Update llama-cpp, whisper, stablediffusion-ggml package.sh to include GPU libraries
- Update Dockerfile.python to package GPU libraries into Python backends
- Update libbackend.sh to set LD_LIBRARY_PATH for GPU library loading
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Address code review feedback: fix variable consistency and quoting
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Fix code review issues: improve glob handling and remove redundant variable
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Simplify main Dockerfile and workflow to use unified base image
- Remove GPU-specific driver installation from Dockerfile (CUDA, ROCm, Vulkan, Intel)
- Simplify image.yml workflow to build single unified base image for linux/amd64 and linux/arm64
- GPU libraries are now packaged in individual backend containers
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-07 15:48:51 +01:00
LocalAI [bot]
7abc0242bb
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7903 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-07 09:46:36 +01:00
LocalAI [bot]
23df29fbd3
chore: ⬆️ Update leejet/stable-diffusion.cpp to 9be0b91927dfa4007d053df72dea7302990226bb ( #7895 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-06 22:18:53 +01:00
LocalAI [bot]
fb9879949c
chore: ⬆️ Update ggml-org/llama.cpp to ccbc84a5374bab7a01f68b129411772ddd8e7c79 ( #7894 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-06 22:18:35 +01:00
Manish Dewangan
1642b39cb8
[gallery] add JSON schema for gallery model specification ( #7890 )
...
Add JSON Schema for gallery model specification
Signed-off-by: devmanishofficial <devmanishofficial@gmail.com >
2026-01-06 22:10:43 +01:00
Richard Palethorpe
e6ba26c3e7
chore: Update to Ubuntu24.04 (cont #7423 ) ( #7769 )
...
* ci(workflows): bump GitHub Actions images to Ubuntu 24.04
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* ci(workflows): remove CUDA 11.x support from GitHub Actions (incompatible with ubuntu:24.04)
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* ci(workflows): bump GitHub Actions CUDA support to 12.9
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* build(docker): bump base image to ubuntu:24.04 and adjust Vulkan SDK/packages
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* fix(backend): correct context paths for Python backends in workflows, Makefile and Dockerfile
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* chore(make): disable parallel backend builds to avoid race conditions
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* chore(make): export CUDA_MAJOR_VERSION and CUDA_MINOR_VERSION for override
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* build(backend): update backend Dockerfiles to Ubuntu 24.04
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* chore(backend): add ROCm env vars and default AMDGPU_TARGETS for hipBLAS builds
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* chore(chatterbox): bump ROCm PyTorch to 2.9.1+rocm6.4 and update index URL; align hipblas requirements
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* chore: add local-ai-launcher to .gitignore
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* ci(workflows): fix backends GitHub Actions workflows after rebase
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* build(docker): use build-time UBUNTU_VERSION variable
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* chore(docker): remove libquadmath0 from requirements-stage base image
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* chore(make): add backends/vllm to .NOTPARALLEL to prevent parallel builds
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* fix(docker): correct CUDA installation steps in backend Dockerfiles
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* chore(backend): update ROCm to 6.4 and align Python hipblas requirements
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* ci(workflows): switch GitHub Actions runners to Ubuntu-24.04 for CUDA on arm64 builds
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* build(docker): update base image and backend Dockerfiles for Ubuntu 24.04 compatibility on arm64
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* build(backend): increase timeout for uv installs behind slow networks on backend/Dockerfile.python
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* ci(workflows): switch GitHub Actions runners to Ubuntu-24.04 for vibevoice backend
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* ci(workflows): fix failing GitHub Actions runners
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
* fix: Allow FROM_SOURCE to be unset, use upstream Intel images etc.
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* chore(build): rm all traces of CUDA 11
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* chore(build): Add Ubuntu codename as an argument
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com >
2026-01-06 15:26:42 +01:00
Ettore Di Giacinto
26c4f80d1b
chore(llama.cpp/flags): simplify conditionals ( #7887 )
...
If ggml handle conditionals correctly we don't need to handle it here.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-06 15:02:20 +01:00
coffeerunhobby
5add7b47f5
fix: BMI2 crash on AVX-only CPUs (Intel Ivy Bridge/Sandy Bridge) ( #7864 )
...
* Fix BMI2 crash on AVX-only CPUs (Intel Ivy Bridge/Sandy Bridge)
Signed-off-by: coffeerunhobby <coffeerunhobby@users.noreply.github.com >
* Address feedback from review
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: coffeerunhobby <coffeerunhobby@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: coffeerunhobby <coffeerunhobby@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-06 00:13:48 +00:00
Ettore Di Giacinto
3244ccc224
chore(image-ui): simplify interface ( #7882 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-05 23:20:28 +01:00
LocalAI [bot]
4f7b6b0bff
chore: ⬆️ Update ggml-org/llama.cpp to e443fbcfa51a8a27b15f949397ab94b5e87b2450 ( #7881 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-05 22:55:40 +01:00
LocalAI [bot]
3a629cea2f
chore: ⬆️ Update ggml-org/whisper.cpp to 679bdb53dbcbfb3e42685f50c7ff367949fd4d48 ( #7879 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-05 22:55:16 +01:00
LocalAI [bot]
f917feda29
chore: ⬆️ Update leejet/stable-diffusion.cpp to c5602a676caff5fe5a9f3b76b2bc614faf5121a5 ( #7880 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-05 22:54:56 +01:00
dependabot[bot]
e2018cdc8f
chore(deps): bump github.com/labstack/echo/v4 from 4.14.0 to 4.15.0 ( #7875 )
...
Bumps [github.com/labstack/echo/v4](https://github.com/labstack/echo ) from 4.14.0 to 4.15.0.
- [Release notes](https://github.com/labstack/echo/releases )
- [Changelog](https://github.com/labstack/echo/blob/master/CHANGELOG.md )
- [Commits](https://github.com/labstack/echo/compare/v4.14.0...v4.15.0 )
---
updated-dependencies:
- dependency-name: github.com/labstack/echo/v4
dependency-version: 4.15.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-05 22:54:30 +01:00
Manish Dewangan
a3b8a94187
fix(ui): fix 404 on API menu link by pointing to index.html ( #7878 )
...
Signed-off-by: devmanishofficial <devmanishofficial@gmail.com >
2026-01-05 22:54:14 +01:00
dependabot[bot]
41de7d32ad
chore(deps): bump dependabot/fetch-metadata from 2.4.0 to 2.5.0 ( #7876 )
...
Bumps [dependabot/fetch-metadata](https://github.com/dependabot/fetch-metadata ) from 2.4.0 to 2.5.0.
- [Release notes](https://github.com/dependabot/fetch-metadata/releases )
- [Commits](https://github.com/dependabot/fetch-metadata/compare/v2.4.0...v2.5.0 )
---
updated-dependencies:
- dependency-name: dependabot/fetch-metadata
dependency-version: 2.5.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-05 20:10:07 +00:00
Richard Palethorpe
93364df0a8
chore(AGENTS.md): Add section to help with building backends ( #7871 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2026-01-05 18:25:52 +01:00
Ettore Di Giacinto
21c84f432f
feat(function): Add tool streaming, XML Tool Call Parsing Support ( #7865 )
...
* feat(function): Add XML Tool Call Parsing Support
Extend the function parsing system in LocalAI to support XML-style tool calls, similar to how JSON tool calls are currently parsed. This will allow models that return XML format (like <tool_call><function=name><parameter=key>value</parameter></function></tool_call>) to be properly parsed alongside text content.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* thinking before tool calls, more strict support for corner cases with no tools
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Support streaming tools
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Iterative JSON
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Iterative parsing
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Consume JSON marker
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixup
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* add tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix pending TODOs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Don't run other parsing with ParseRegex
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-05 18:25:40 +01:00
LocalAI [bot]
9d3da0bed5
chore: ⬆️ Update ggml-org/llama.cpp to 4974bf53cf14073c7b66e1151348156aabd42cb8 ( #7861 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-05 00:10:18 +01:00
LocalAI [bot]
1b063b5595
chore: ⬆️ Update leejet/stable-diffusion.cpp to b90b1ee9cf84ea48b478c674dd2ec6a33fd504d6 ( #7862 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-04 23:52:01 +01:00
Ettore Di Giacinto
560bf50299
chore(Makefile): refactor common make targets ( #7858 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-04 21:12:50 +01:00
LocalAI [bot]
a7e155240b
chore: ⬆️ Update ggml-org/llama.cpp to e57f52334b2e8436a94f7e332462dfc63a08f995 ( #7848 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-04 10:27:45 +01:00
LocalAI [bot]
793e4907a2
feat(swagger): update swagger ( #7847 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-03 22:09:39 +01:00
Ettore Di Giacinto
d38811560c
chore(docs): add opencode, GHA, and realtime voice assistant examples
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-03 22:03:43 +01:00
Ettore Di Giacinto
33cc0b8e13
fix(chat/ui): record model name in history for consistency ( #7845 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-03 18:05:33 +01:00
lif
4cd95b8a9d
fix: Highly inconsistent agent response to cogito agent calling MCP server - Body "Invalid http method" ( #7790 )
...
* fix: resolve duplicate MCP route registration causing 50% failure rate
Fixes #7772
The issue was caused by duplicate registration of the MCP endpoint
/mcp/v1/chat/completions in both openai.go and localai.go, leading
to a race condition where requests would randomly hit different
handlers with incompatible behaviors.
Changes:
- Removed duplicate MCP route registration from openai.go
- Kept the localai.MCPStreamEndpoint as the canonical handler
- Added all three MCP route patterns for backward compatibility:
* /v1/mcp/chat/completions
* /mcp/v1/chat/completions
* /mcp/chat/completions
- Added comments to clarify route ownership and prevent future conflicts
- Fixed formatting in ui_api.go
The localai.MCPStreamEndpoint handler is more feature-complete as it
supports both streaming and non-streaming modes, while the removed
openai.MCPCompletionEndpoint only supported synchronous requests.
This eliminates the ~50% failure rate where the cogito library would
receive "Invalid http method" errors when internal HTTP requests were
routed to the wrong handler.
🤖 Generated with [Claude Code](https://claude.com/claude-code )
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com >
Signed-off-by: majiayu000 <1835304752@qq.com >
* Address feedback from review
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: majiayu000 <1835304752@qq.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com >
Co-authored-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-03 15:43:23 +01:00
LocalAI [bot]
8c504113a2
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7840 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-03 08:42:05 +01:00
coffeerunhobby
666d110714
fix: Prevent BMI2 instruction crash on AVX-only CPUs ( #7817 )
...
* Fix: Prevent BMI2 instruction crash on AVX-only CPUs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: apply no-bmi flags on non-darwin
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: coffeerunhobby <coffeerunhobby@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-03 08:36:55 +01:00
LocalAI [bot]
641606ae93
chore: ⬆️ Update ggml-org/llama.cpp to 706e3f93a60109a40f1224eaf4af0d59caa7c3ae ( #7836 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-02 21:26:37 +00:00
Ettore Di Giacinto
5f6c941399
fix(llama.cpp/mmproj): fix loading mmproj in nested sub-dirs different from model path ( #7832 )
...
fix(mmproj): fix loading mmproj in nested sub-dirs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-02 20:17:30 +01:00
LocalAI [bot]
1639fc6309
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7831 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-02 15:10:00 +01:00
Ettore Di Giacinto
841e8f6d47
fix(image-gen): fix scrolling issues ( #7829 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-02 09:05:49 +01:00
LocalAI [bot]
fd152c97c0
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7826 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-02 08:45:43 +01:00
LocalAI [bot]
949de04052
chore: ⬆️ Update ggml-org/llama.cpp to ced765be44ce173c374f295b3c6f4175f8fd109b ( #7822 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-02 08:44:49 +01:00
Ettore Di Giacinto
76cfe1f367
feat(image-gen/UI): move controls to the left, make the page more compact ( #7823 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-01 22:07:42 +01:00
LocalAI [bot]
5ee6c1810b
feat(swagger): update swagger ( #7820 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-01 21:16:38 +01:00
LocalAI [bot]
7db79aadfa
chore(model-gallery): ⬆️ update checksum ( #7821 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-01 21:16:11 +01:00
nold
dee48679b4
Fix(gallery): Updated checksums for qwen3-vl-30b instruct & thinking ( #7819 )
...
* Fix(gallery): SHA256 hashes for qwen3-vl-30b-instruct
Signed-off-by: nold <Nold360@users.noreply.github.com >
* Fix(gallery): SHA256 checksums for qwen3-vl-30b-thinking
Signed-off-by: nold <Nold360@users.noreply.github.com >
---------
Signed-off-by: nold <Nold360@users.noreply.github.com >
2026-01-01 20:33:55 +01:00
LocalAI [bot]
94b47a9310
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7816 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-01 19:20:26 +01:00
LocalAI [bot]
bc3e8793ed
chore: ⬆️ Update ggml-org/llama.cpp to 13814eb370d2f0b70e1830cc577b6155b17aee47 ( #7809 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-31 23:04:01 +01:00
LocalAI [bot]
91978bb3a5
chore: ⬆️ Update ggml-org/whisper.cpp to e9898ddfb908ffaa7026c66852a023889a5a7202 ( #7810 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-31 22:59:05 +01:00
Ettore Di Giacinto
797f27f09f
feat(UI): image generation improvements ( #7804 )
...
* chore: drop mode from image generation(unused)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(UI): improve image generation front-end
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(UI): only ref images. files is to be deprecated
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* do not override default steps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-31 21:59:46 +01:00
LocalAI [bot]
3f1631aa87
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7807 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-31 19:29:59 +01:00
LocalAI [bot]
dad509637e
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7801 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-31 09:18:35 +01:00
LocalAI [bot]
218f3a126a
chore: ⬆️ Update ggml-org/llama.cpp to 0f89d2ecf14270f45f43c442e90ae433fd82dab1 ( #7795 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-31 08:53:41 +01:00
Ettore Di Giacinto
be77a845fa
fix(gallery agent): change model
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-30 22:34:25 +00:00
Ettore Di Giacinto
ca32286022
fix(gallery agent): change model
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-30 22:27:48 +00:00
Ettore Di Giacinto
1f592505dd
fix(gallery agent): change model
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-30 22:22:45 +00:00
Ettore Di Giacinto
b3bc623eb3
fix(gallery agent): fixups
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-30 22:18:02 +00:00
Ettore Di Giacinto
e56391cf14
Add individual sponsors acknowledgment in README
...
Added a section to acknowledge individual sponsors and their contributions.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-30 23:01:22 +01:00
Ettore Di Giacinto
ef3ffe4a4e
fix(gallery agent): fixups
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-30 21:56:54 +00:00
Ettore Di Giacinto
3cffde2cd5
fix(gallery agent): skip model selection if only one
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-30 21:53:37 +00:00
LocalAI [bot]
234bf7e2ad
feat(swagger): update swagger ( #7794 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-30 21:05:01 +00:00
lif
ba73d2e759
fix: Failed to download checksums.txt when using launch to install localai ( #7788 )
...
* fix: add retry logic and fallback for checksums.txt download
- Add HTTP client with 30s timeout to ReleaseManager
- Implement downloadFileWithRetry with 3 attempts and exponential backoff
- Allow manual checksum placement at ~/.localai/checksums/checksums-<version>.txt
- Continue installation with warning if checksum download/verification fails
- Add test for HTTPClient initialization
- Fix linter error in systray_manager.go
Fixes #7385
Signed-off-by: majiayu000 <1835304752@qq.com >
* fix: add retry logic and improve checksums.txt download handling
This commit addresses issue #7385 by implementing:
- Retry logic (3 attempts) for checksum file downloads
- Fallback to manually placed checksum files
- Option to proceed with installation if checksums unavailable (with warnings)
- Fixed resource leaks in download retry loop
- Added configurable HTTP client with 30s timeout
The installation will now be more resilient to network issues while
maintaining security through checksum verification when available.
Signed-off-by: majiayu000 <1835304752@qq.com >
* fix: check for existing checksum file before downloading
This commit addresses the review feedback from mudler on PR #7788 .
The code now checks if there's already a checksum file (either manually
placed or previously downloaded) and honors that, skipping download
entirely in such case.
Changes:
- Check for existing checksum file at ~/.localai/checksums/checksums-<version>.txt first
- Check for existing downloaded checksum file at binary path
- Only attempt to download if no existing checksum file is found
- This prevents unnecessary network requests and honors user-placed checksums
Signed-off-by: majiayu000 <1835304752@qq.com >
🤖 Generated with Claude Code
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com >
---------
Signed-off-by: majiayu000 <1835304752@qq.com >
Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com >
2025-12-30 18:33:44 +01:00
Ettore Di Giacinto
592697216b
Revert "chore(deps): bump securego/gosec from 2.22.9 to 2.22.11" ( #7789 )
...
Revert "chore(deps): bump securego/gosec from 2.22.9 to 2.22.11 (#7774 )"
This reverts commit 0c16f55b45 .
2025-12-30 09:58:13 +01:00
lif
8bd7143a44
fix: propagate validation errors ( #7787 )
...
fix: validate MCP configuration in model config
Fixes #7334
The Validate() function was not checking if MCP configuration
(mcp.stdio and mcp.remote) contains valid JSON. This caused
malformed JSON with missing commas to be silently accepted.
Changes:
- Add MCP configuration validation to ModelConfig.Validate()
- Properly report validation errors instead of discarding them
- Add test cases for valid and invalid MCP configurations
The fix ensures that malformed JSON in MCP config sections
will now be caught and reported during validation.
🤖 Generated with [Claude Code](https://claude.com/claude-code )
Signed-off-by: majiayu000 <1835304752@qq.com >
Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com >
2025-12-30 09:54:27 +01:00
lif
0d0ef0121c
fix: Usage for image generation is incorrect (and causes error in LiteLLM) ( #7786 )
...
* fix: Add usage fields to image generation response for OpenAI API compatibility
Fixes #7354
Added input_tokens, output_tokens, and input_tokens_details fields to the
image generation API response to comply with OpenAI's image generation API
specification. This resolves validation errors in LiteLLM and the OpenAI SDK.
Changes:
- Added InputTokensDetails struct with text_tokens and image_tokens fields
- Extended OpenAIUsage struct with input_tokens, output_tokens, and input_tokens_details
- Updated ImageEndpoint to populate usage object with required fields
- Updated InpaintingEndpoint to populate usage object with required fields
- All fields initialized to 0 as per current behavior
🤖 Generated with [Claude Code](https://claude.com/claude-code )
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com >
Signed-off-by: majiayu000 <1835304752@qq.com >
* fix: Correct usage field types for image generation API compatibility
Changed InputTokens and OutputTokens from pointer types (*int) to
regular int types to match OpenAI API specification. This fixes
validation errors with LiteLLM and OpenAI SDK when parsing image
generation responses.
Fixes #7354
🤖 Generated with [Claude Code](https://claude.com/claude-code )
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com >
Signed-off-by: majiayu000 <1835304752@qq.com >
---------
Signed-off-by: majiayu000 <1835304752@qq.com >
Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com >
2025-12-30 09:53:05 +01:00
lif
d7b2eee08f
fix: add nil checks before mergo.Merge to prevent panic in gallery model installation ( #7785 )
...
Fixes #7420
Added nil checks before calling mergo.Merge in InstallModelFromGallery and InstallModel
functions to prevent panic when req.Overrides or configOverrides are nil. The panic was
occurring at models.go:248 during Qwen-Image-Edit gallery model download.
Changes:
- Added nil check for req.Overrides before merging in InstallModelFromGallery (line 126)
- Added nil check for configOverrides before merging in InstallModel (line 248)
- Added test case to verify nil configOverrides are handled without panic
Signed-off-by: majiayu000 <1835304752@qq.com >
2025-12-30 09:51:45 +01:00
LocalAI [bot]
bc8ec5cb39
chore: ⬆️ Update ggml-org/llama.cpp to c9a3b40d6578f2381a1373d10249403d58c3c5bd ( #7778 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-30 08:27:16 +01:00
dependabot[bot]
3f38fecdfc
chore(deps): bump github.com/modelcontextprotocol/go-sdk from 1.1.0 to 1.2.0 ( #7776 )
...
chore(deps): bump github.com/modelcontextprotocol/go-sdk
Bumps [github.com/modelcontextprotocol/go-sdk](https://github.com/modelcontextprotocol/go-sdk ) from 1.1.0 to 1.2.0.
- [Release notes](https://github.com/modelcontextprotocol/go-sdk/releases )
- [Commits](https://github.com/modelcontextprotocol/go-sdk/compare/v1.1.0...v1.2.0 )
---
updated-dependencies:
- dependency-name: github.com/modelcontextprotocol/go-sdk
dependency-version: 1.2.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-29 22:15:29 +01:00
dependabot[bot]
20a4199229
chore(deps): bump github.com/schollz/progressbar/v3 from 3.18.0 to 3.19.0 ( #7775 )
...
chore(deps): bump github.com/schollz/progressbar/v3
Bumps [github.com/schollz/progressbar/v3](https://github.com/schollz/progressbar ) from 3.18.0 to 3.19.0.
- [Release notes](https://github.com/schollz/progressbar/releases )
- [Commits](https://github.com/schollz/progressbar/compare/v3.18.0...v3.19.0 )
---
updated-dependencies:
- dependency-name: github.com/schollz/progressbar/v3
dependency-version: 3.19.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-29 22:15:11 +01:00
Ettore Di Giacinto
ded9955881
chore(ci): do not select models if we have only 1 result
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-29 22:14:14 +01:00
dependabot[bot]
cf78f9a2a8
chore(deps): bump google.golang.org/grpc from 1.77.0 to 1.78.0 ( #7777 )
...
Bumps [google.golang.org/grpc](https://github.com/grpc/grpc-go ) from 1.77.0 to 1.78.0.
- [Release notes](https://github.com/grpc/grpc-go/releases )
- [Commits](https://github.com/grpc/grpc-go/compare/v1.77.0...v1.78.0 )
---
updated-dependencies:
- dependency-name: google.golang.org/grpc
dependency-version: 1.78.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-29 21:03:57 +01:00
dependabot[bot]
0c16f55b45
chore(deps): bump securego/gosec from 2.22.9 to 2.22.11 ( #7774 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.9 to 2.22.11.
- [Release notes](https://github.com/securego/gosec/releases )
- [Commits](https://github.com/securego/gosec/compare/v2.22.9...v2.22.11 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.11
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-29 19:18:29 +00:00
Richard Palethorpe
0b80167912
chore: ⬆️ Update leejet/stable-diffusion.cpp to 4ff2c8c74bd17c2cfffe3a01be77743fb3efba2f ( #7771 )
...
* ⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* fix: Add KL_OPTIMAL scheduler, pass sampler to default scheduler for LCM and fixup other refactorings from upstream
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* Delete backend/go/stablediffusion-ggml/compile_commands.json
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
---------
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-29 19:06:35 +01:00
Richard Palethorpe
99b5c5f156
feat(api): Allow tracing of requests and responses ( #7609 )
...
* feat(api): Allow tracing of requests and responses
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* feat(traces): Add traces UI
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-12-29 11:06:06 +01:00
Ettore Di Giacinto
9ab812a8e8
chore(ci): be more precise when detecting existing models ( #7767 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-29 10:06:42 +01:00
Ettore Di Giacinto
185a685211
fix(amd-gpu): correctly show total and used vram ( #7761 )
...
An example output of `rocm-smi --showproductname --showmeminfo vram --showuniqueid --csv`:
```
device,Unique ID,VRAM Total Memory (B),VRAM Total Used Memory (B),Card Series,Card Model,Card Vendor,Card SKU,Subsystem ID,Device Rev,Node ID,GUID,GFX Version
card0,0x9246____________,17163091968,692142080,Navi 21 [Radeon RX 6800/6800 XT / 6900 XT],0x73bf,Advanced Micro Devices Inc. [AMD/ATI],001,0x2406,0xc1,1,45534,gfx1030
card1,N/A,67108864,26079232,Raphael,0x164e,Advanced Micro Devices Inc. [AMD/ATI],RAPHAEL,0x364e,0xc6,2,52156,gfx1036
```
Total memory is actually showed before the total used memory as can be seen in https://github.com/LostRuins/koboldcpp/issues/1104#issuecomment-2321143507 .
This PR fixes https://github.com/mudler/LocalAI/issues/7724
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-29 07:57:07 +01:00
LocalAI [bot]
1a6fd0f7fc
chore: ⬆️ Update ggml-org/llama.cpp to 4ffc47cb2001e7d523f9ff525335bbe34b1a2858 ( #7760 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-28 21:10:39 +00:00
LocalAI [bot]
c95c482f36
chore: ⬆️ Update ggml-org/llama.cpp to a4bf35889eda36d3597cd0f8f333f5b8a2fcaefc ( #7751 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-27 21:09:12 +00:00
Ettore Di Giacinto
21c464c34f
fix(cli): import via CLI needs system state ( #7746 )
...
pass system state to application config to avoid nil pointer exception
during import.
Fixes: https://github.com/mudler/LocalAI/issues/7728
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-27 11:10:28 +01:00
LocalAI [bot]
ddf0281785
chore: ⬆️ Update ggml-org/llama.cpp to 7ac8902133da6eb390c4d8368a7d252279123942 ( #7740 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-26 21:44:34 +00:00
LocalAI [bot]
86c68c9623
chore: ⬆️ Update ggml-org/llama.cpp to 85c40c9b02941ebf1add1469af75f1796d513ef4 ( #7731 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-25 21:10:28 +00:00
Ettore Di Giacinto
c844b7ac58
feat: disable force eviction ( #7725 )
...
* feat: allow to set forcing backends eviction while requests are in flight
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: try to make the request sit and retry if eviction couldn't be done
Otherwise calls that in order to pass would need to shutdown other
backends would just fail.
In this way instead we make the request sit and retry eviction until it
succeeds. The thresholds can be configured by the user.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* add tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* expose settings to CLI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-25 14:26:18 +01:00
Ettore Di Giacinto
bb459e671f
fix(ui): correctly parse import errors ( #7726 )
...
errors are nested
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-25 10:43:12 +01:00
LocalAI [bot]
2fe6e278c8
chore: ⬆️ Update ggml-org/llama.cpp to c18428423018ed214c004e6ecaedb0cbdda06805 ( #7718 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-25 10:00:40 +01:00
LocalAI [bot]
ae69921d77
chore: ⬆️ Update ggml-org/whisper.cpp to 6114e692136bea917dc88a5eb2e532c3d133d963 ( #7717 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-25 10:00:24 +01:00
Ettore Di Giacinto
bf2f95c684
chore(docs): update docs with cuda 13 instructions and the new vibevoice backend
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-25 10:00:07 +01:00
LocalAI [bot]
94069f2751
docs: ⬆️ update docs version mudler/LocalAI ( #7716 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-24 21:06:02 +00:00
LocalAI [bot]
aadec0b8cb
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7712 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-24 13:00:54 +01:00
Ettore Di Giacinto
35d71cf25e
fix: remove duplicate logging line
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-24 09:35:18 +01:00
Ettore Di Giacinto
39a5a84e64
fix: include virtual config
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-24 09:30:29 +01:00
Ettore Di Giacinto
83ed16f325
chore(logging): be consistent and do not emit logs from echo ( #7710 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-24 09:22:27 +01:00
Ettore Di Giacinto
c8173f0f67
chore(gallery): cleanup old architectures
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-24 09:14:03 +01:00
LocalAI [bot]
6dc2dbc835
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7707 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-24 08:34:18 +01:00
Ettore Di Giacinto
0a168830ea
chore(deps): Bump llama.cpp to '5b6c9bc0f3c8f55598b9999b65aff7ce4119bc15' and refactor usage of base params ( #7706 )
...
* chore(deps): Bump llama.cpp to '5b6c9bc0f3c8f55598b9999b65aff7ce4119bc15' and refactor usage of base params
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: update AGENTS.md
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-24 00:28:27 +01:00
LocalAI [bot]
96d3f0ebc8
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7700 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-23 08:53:18 +01:00
Ettore Di Giacinto
b8aacb39e8
Revert "chore(deps): bump securego/gosec from 2.22.9 to 2.22.11" ( #7698 )
...
Revert "chore(deps): bump securego/gosec from 2.22.9 to 2.22.11 (#7690 )"
This reverts commit b698033ef9 .
2025-12-22 23:58:42 +01:00
Ettore Di Giacinto
b36a7593fa
chore(gallery): cleanup old (superseded) archs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-22 22:55:53 +00:00
Ettore Di Giacinto
1ab91edc08
chore(gallery): cleanup old (superseded) archs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-22 22:53:29 +00:00
Ettore Di Giacinto
31f4e0c46d
chore(gallery agent): various fixups ( #7697 )
...
* chore(ci/agent): fix formatting issues
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: get icon from readme/hf and prepend to the gallery file
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-22 23:46:40 +01:00
dependabot[bot]
07c80fba88
chore(deps): bump github.com/containerd/containerd from 1.7.29 to 1.7.30 ( #7692 )
...
Bumps [github.com/containerd/containerd](https://github.com/containerd/containerd ) from 1.7.29 to 1.7.30.
- [Release notes](https://github.com/containerd/containerd/releases )
- [Changelog](https://github.com/containerd/containerd/blob/main/RELEASES.md )
- [Commits](https://github.com/containerd/containerd/compare/v1.7.29...v1.7.30 )
---
updated-dependencies:
- dependency-name: github.com/containerd/containerd
dependency-version: 1.7.30
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-22 22:43:42 +01:00
dependabot[bot]
9256a21d2c
chore(deps): bump github.com/jaypipes/ghw from 0.21.1 to 0.21.2 ( #7694 )
...
Bumps [github.com/jaypipes/ghw](https://github.com/jaypipes/ghw ) from 0.21.1 to 0.21.2.
- [Release notes](https://github.com/jaypipes/ghw/releases )
- [Commits](https://github.com/jaypipes/ghw/compare/v0.21.1...v0.21.2 )
---
updated-dependencies:
- dependency-name: github.com/jaypipes/ghw
dependency-version: 0.21.2
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-22 22:43:00 +01:00
dependabot[bot]
b3a81292c1
chore(deps): bump github.com/mudler/cogito from 0.7.1 to 0.7.2 ( #7691 )
...
Bumps [github.com/mudler/cogito](https://github.com/mudler/cogito ) from 0.7.1 to 0.7.2.
- [Release notes](https://github.com/mudler/cogito/releases )
- [Commits](https://github.com/mudler/cogito/compare/v0.7.1...v0.7.2 )
---
updated-dependencies:
- dependency-name: github.com/mudler/cogito
dependency-version: 0.7.2
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-22 22:42:35 +01:00
dependabot[bot]
5fc0cafd86
chore(deps): bump github.com/mudler/xlog from 0.0.3 to 0.0.4 ( #7695 )
...
Bumps [github.com/mudler/xlog](https://github.com/mudler/xlog ) from 0.0.3 to 0.0.4.
- [Release notes](https://github.com/mudler/xlog/releases )
- [Commits](https://github.com/mudler/xlog/compare/v0.0.3...v0.0.4 )
---
updated-dependencies:
- dependency-name: github.com/mudler/xlog
dependency-version: 0.0.4
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-22 22:42:08 +01:00
Richard Palethorpe
9783aeaef5
chore: Add AGENTS.md ( #7688 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-12-22 22:41:33 +01:00
dependabot[bot]
b698033ef9
chore(deps): bump securego/gosec from 2.22.9 to 2.22.11 ( #7690 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.9 to 2.22.11.
- [Release notes](https://github.com/securego/gosec/releases )
- [Commits](https://github.com/securego/gosec/compare/v2.22.9...v2.22.11 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.11
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-22 19:09:06 +00:00
Ettore Di Giacinto
fc6057a952
chore(deps): bump llama.cpp to '0e1ccf15c7b6d05c720551b537857ecf6194d420' ( #7684 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-22 09:50:42 +01:00
Ettore Di Giacinto
8b3e0ebf8a
chore: allow to set local-ai log format, default to custom one ( #7679 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-21 21:21:59 +01:00
Mikhail Khludnev
53b0530275
docs: Add langchain-localai integration package to documentation ( #7677 )
...
Add `langchain-localai` integration package to documentation
Signed-off-by: Mikhail Khludnev <mkhludnev@users.noreply.github.com >
2025-12-21 21:02:14 +01:00
Ettore Di Giacinto
99d301fcf9
chore(deps): bump xlog to v0.0.3 ( #7675 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-21 19:36:54 +01:00
Ettore Di Giacinto
c37785b78c
chore(refactor): move logging to common package based on slog ( #7668 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-21 19:33:13 +01:00
LocalAI [bot]
38cde81ff4
chore: ⬆️ Update ggml-org/llama.cpp to 52ab19df633f3de5d4db171a16f2d9edd2342fec ( #7665 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-20 21:09:15 +00:00
Ettore Di Giacinto
8ba5d6e796
chore(cogito): respect application-level logging and propagate ( #7656 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-19 23:02:08 +01:00
Ettore Di Giacinto
8b6f443cd5
chore(deps): bump cogito to latest and adapt API changes ( #7655 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-19 22:50:18 +01:00
LocalAI [bot]
626057bcca
chore: ⬆️ Update ggml-org/llama.cpp to ce734a8a2f9fb6eb4f0383ab1370a1b0014ab787 ( #7654 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-19 21:15:39 +00:00
LocalAI [bot]
aa0efeb0a8
chore: ⬆️ Update ggml-org/whisper.cpp to 6c22e792cb0ee155b6587ce71a8410c3aeb06949 ( #7644 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-19 09:26:41 +01:00
LocalAI [bot]
f25ac00bca
chore: ⬆️ Update ggml-org/llama.cpp to f9ec8858edea4a0ecfea149d6815ebfb5ecc3bcd ( #7642 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-18 21:17:14 +00:00
Richard Palethorpe
c3494a0927
chore: ⬆️ Update leejet/stable-diffusion.cpp to bda7fab9f208dff4b67179a68f694b6ddec13326 ( #7639 )
...
* ⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* fix(stablediffusion-ggml): Don't set removed lora model dir
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-18 20:52:22 +01:00
Richard Palethorpe
716dba94b4
feat(whisper): Add prompt to condition transcription output ( #7624 )
...
* chore(makefile): Add buildargs for sd and cuda when building backend
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* feat(whisper): Add prompt to condition transcription output
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-12-18 14:40:45 +01:00
mintyleaf
247983265d
fix(uri): consider subfolders when expanding huggingface URLs ( #7634 )
...
Update uri.go
Signed-off-by: mintyleaf <mintyleafdev@gmail.com >
2025-12-18 09:12:16 +01:00
LocalAI [bot]
5515119a7e
chore: ⬆️ Update ggml-org/llama.cpp to d37fc935059211454e9ad2e2a44e8ed78fd6d1ce ( #7629 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-18 09:07:09 +01:00
LocalAI [bot]
4535e7dfc4
chore: ⬆️ Update ggml-org/whisper.cpp to 3e79e73eee32e924fbd34587f2f2ac5a45a26b61 ( #7630 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-18 09:06:48 +01:00
Ettore Di Giacinto
d8ee02e607
chore(tests): simplify tests and run intensive ones only once
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-18 09:05:58 +01:00
Ettore Di Giacinto
2d2e8759bb
fix(ci): remove specific version for grpcio packages ( #7627 )
...
Updated grpcio-tools and grpcio installation to latest version.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-17 19:18:07 +01:00
LocalAI [bot]
14bb65b57b
chore: ⬆️ Update ggml-org/llama.cpp to ef83fb8601229ff650d952985be47e82d644bfaa ( #7611 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-17 08:32:42 +01:00
Ettore Di Giacinto
3ca90876f1
chore(memory detection): do not use go-sigar as requires CGO on darwin ( #7618 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-16 23:10:42 +01:00
Ettore Di Giacinto
f251bdee64
chore: fixup tests with defaults from constants
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-16 21:26:55 +00:00
Ettore Di Giacinto
61afe4ca60
chore: drop drawin-x86_64 support ( #7616 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-16 21:22:15 +01:00
Ettore Di Giacinto
424c95edba
fix: correctly propagate error during model load ( #7610 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-16 18:26:54 +01:00
Ettore Di Giacinto
b348a99b03
chore: move defaults to constants
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-16 17:40:51 +01:00
Ettore Di Giacinto
f3c70a96ba
chore(memory-reclaimer): use saner defaults
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-16 16:25:09 +01:00
Ettore Di Giacinto
e3e5f59965
fix(ram): do not read from cgroup ( #7606 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-16 13:28:11 +01:00
blightbow
67baf66555
feat(mlx): add thread-safe LRU prompt cache and min_p/top_k sampling ( #7556 )
...
* feat(mlx): add thread-safe LRU prompt cache
Port mlx-lm's LRUPromptCache to fix race condition where concurrent
requests corrupt shared KV cache state. The previous implementation
used a single prompt_cache instance shared across all requests.
Changes:
- Add backend/python/common/mlx_cache.py with ThreadSafeLRUPromptCache
- Modify backend.py to use per-request cache isolation via fetch/insert
- Add prefix matching for cache reuse across similar prompts
- Add LRU eviction (default 10 entries, configurable)
- Add concurrency and cache unit tests
The cache uses a trie-based structure for efficient prefix matching,
allowing prompts that share common prefixes to reuse cached KV states.
Thread safety is provided via threading.Lock.
New configuration options:
- max_cache_entries: Maximum LRU cache entries (default: 10)
- max_kv_size: Maximum KV cache size per entry (default: None)
🤖 Generated with [Claude Code](https://claude.com/claude-code )
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
Signed-off-by: Blightbow <blightbow@users.noreply.github.com >
* feat(mlx): add min_p and top_k sampler support
Add MinP field to proto (field 52) following the precedent set by
other non-OpenAI sampling parameters like TopK, TailFreeSamplingZ,
TypicalP, and Mirostat.
Changes:
- backend.proto: Add float MinP field for min-p sampling
- backend.py: Extract and pass min_p and top_k to mlx_lm sampler
(top_k was in proto but not being passed)
- test.py: Fix test_sampling_params to use valid proto fields and
switch to MLX-compatible model (mlx-community/Llama-3.2-1B-Instruct)
🤖 Generated with [Claude Code](https://claude.com/claude-code )
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
Signed-off-by: Blightbow <blightbow@users.noreply.github.com >
* refactor(mlx): move mlx_cache.py from common to mlx backend
The ThreadSafeLRUPromptCache is only used by the mlx backend. After
evaluating mlx-vlm, it was determined that the cache cannot be shared
because mlx-vlm's generate/stream_generate functions don't support
the prompt_cache parameter that mlx_lm provides.
- Move mlx_cache.py from backend/python/common/ to backend/python/mlx/
- Remove sys.path manipulation from backend.py and test.py
- Fix test assertion to expect "MLX model loaded successfully"
🤖 Generated with [Claude Code](https://claude.com/claude-code )
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
Signed-off-by: Blightbow <blightbow@users.noreply.github.com >
* test(mlx): add comprehensive cache tests and document upstream behavior
Added comprehensive unit tests (test_mlx_cache.py) covering all cache
operation modes:
- Exact match
- Shorter prefix match
- Longer prefix match with trimming
- No match scenarios
- LRU eviction and access order
- Reference counting and deep copy behavior
- Multi-model namespacing
- Thread safety with data integrity verification
Documents upstream mlx_lm/server.py behavior: single-token prefixes are
deliberately not matched (uses > 0, not >= 0) to allow longer cached
sequences to be preferred for trimming. This is acceptable because real
prompts with chat templates are always many tokens.
Removed weak unit tests from test.py that only verified "no exception
thrown" rather than correctness.
🤖 Generated with [Claude Code](https://claude.com/claude-code )
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
Signed-off-by: Blightbow <blightbow@users.noreply.github.com >
* chore(mlx): remove unused MinP proto field
The MinP field was added to PredictOptions but is not populated by the
Go frontend/API. The MLX backend uses getattr with a default value,
so it works without the proto field.
🤖 Generated with [Claude Code](https://claude.com/claude-code )
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
Signed-off-by: Blightbow <blightbow@users.noreply.github.com >
---------
Signed-off-by: Blightbow <blightbow@users.noreply.github.com >
Co-authored-by: Blightbow <blightbow@users.noreply.github.com >
Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com >
2025-12-16 11:27:46 +01:00
Ettore Di Giacinto
878c9d46d5
fix: improve ram estimation ( #7603 )
...
* fix: default to 10seconds of watchdog if runtime setting is malformed
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: use gosigar for RAM estimation
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-16 10:18:36 +01:00
Ettore Di Giacinto
b841a495da
Revert "chore(deps): bump securego/gosec from 2.22.9 to 2.22.11" ( #7602 )
...
Revert "chore(deps): bump securego/gosec from 2.22.9 to 2.22.11 (#7588 )"
This reverts commit 648dfc0389 .
2025-12-16 09:48:46 +01:00
Ettore Di Giacinto
f75903d7f7
Update latest project news in README
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-16 09:16:42 +01:00
Ettore Di Giacinto
50f9c9a058
feat(watchdog): add Memory resource reclaimer ( #7583 )
...
* feat(watchdog): add GPU reclaimer
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Handle vram calculation for unified memory devices
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Support RAM eviction, set watchdog interval from runtime settings
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-16 09:15:18 +01:00
dependabot[bot]
dbd25885c3
chore(deps): bump sentence-transformers from 5.1.0 to 5.2.0 in /backend/python/transformers ( #7594 )
...
chore(deps): bump sentence-transformers in /backend/python/transformers
Bumps [sentence-transformers](https://github.com/huggingface/sentence-transformers ) from 5.1.0 to 5.2.0.
- [Release notes](https://github.com/huggingface/sentence-transformers/releases )
- [Commits](https://github.com/huggingface/sentence-transformers/compare/v5.1.0...v5.2.0 )
---
updated-dependencies:
- dependency-name: sentence-transformers
dependency-version: 5.2.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-16 09:12:57 +01:00
dependabot[bot]
3d55055126
chore(deps): bump github.com/jaypipes/ghw from 0.20.0 to 0.21.1 ( #7591 )
...
Bumps [github.com/jaypipes/ghw](https://github.com/jaypipes/ghw ) from 0.20.0 to 0.21.1.
- [Release notes](https://github.com/jaypipes/ghw/releases )
- [Commits](https://github.com/jaypipes/ghw/compare/v0.20.0...v0.21.1 )
---
updated-dependencies:
- dependency-name: github.com/jaypipes/ghw
dependency-version: 0.21.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-16 08:16:05 +01:00
dependabot[bot]
af7ba2e3de
chore(deps): bump github.com/labstack/echo/v4 from 4.13.4 to 4.14.0 ( #7589 )
...
Bumps [github.com/labstack/echo/v4](https://github.com/labstack/echo ) from 4.13.4 to 4.14.0.
- [Release notes](https://github.com/labstack/echo/releases )
- [Changelog](https://github.com/labstack/echo/blob/master/CHANGELOG.md )
- [Commits](https://github.com/labstack/echo/compare/v4.13.4...v4.14.0 )
---
updated-dependencies:
- dependency-name: github.com/labstack/echo/v4
dependency-version: 4.14.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-16 08:15:41 +01:00
LocalAI [bot]
7a3b0bbfaa
chore: ⬆️ Update leejet/stable-diffusion.cpp to 200cb6f2ca07e40fa83b610a4e595f4da06ec709 ( #7597 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-16 08:15:15 +01:00
dependabot[bot]
648dfc0389
chore(deps): bump securego/gosec from 2.22.9 to 2.22.11 ( #7588 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.9 to 2.22.11.
- [Release notes](https://github.com/securego/gosec/releases )
- [Commits](https://github.com/securego/gosec/compare/v2.22.9...v2.22.11 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.11
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-16 01:49:11 +00:00
dependabot[bot]
b396413ad5
chore(deps): bump actions/download-artifact from 6 to 7 ( #7587 )
...
Bumps [actions/download-artifact](https://github.com/actions/download-artifact ) from 6 to 7.
- [Release notes](https://github.com/actions/download-artifact/releases )
- [Commits](https://github.com/actions/download-artifact/compare/v6...v7 )
---
updated-dependencies:
- dependency-name: actions/download-artifact
dependency-version: '7'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-16 00:14:02 +01:00
dependabot[bot]
2ad928678c
chore(deps): bump peter-evans/create-pull-request from 7 to 8 ( #7586 )
...
Bumps [peter-evans/create-pull-request](https://github.com/peter-evans/create-pull-request ) from 7 to 8.
- [Release notes](https://github.com/peter-evans/create-pull-request/releases )
- [Commits](https://github.com/peter-evans/create-pull-request/compare/v7...v8 )
---
updated-dependencies:
- dependency-name: peter-evans/create-pull-request
dependency-version: '8'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-16 00:13:42 +01:00
dependabot[bot]
9b27b53a50
chore(deps): bump github.com/onsi/ginkgo/v2 from 2.27.2 to 2.27.3 ( #7590 )
...
Bumps [github.com/onsi/ginkgo/v2](https://github.com/onsi/ginkgo ) from 2.27.2 to 2.27.3.
- [Release notes](https://github.com/onsi/ginkgo/releases )
- [Changelog](https://github.com/onsi/ginkgo/blob/master/CHANGELOG.md )
- [Commits](https://github.com/onsi/ginkgo/compare/v2.27.2...v2.27.3 )
---
updated-dependencies:
- dependency-name: github.com/onsi/ginkgo/v2
dependency-version: 2.27.3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-15 22:58:45 +01:00
Ettore Di Giacinto
2387b266d8
chore(llama.cpp): Add Missing llama.cpp Options to gRPC Server ( #7584 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-15 21:55:20 +01:00
dependabot[bot]
0f2df23c61
chore(deps): bump actions/upload-artifact from 5 to 6 ( #7585 )
...
Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact ) from 5 to 6.
- [Release notes](https://github.com/actions/upload-artifact/releases )
- [Commits](https://github.com/actions/upload-artifact/compare/v5...v6 )
---
updated-dependencies:
- dependency-name: actions/upload-artifact
dependency-version: '6'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-15 19:33:48 +00:00
Ettore Di Giacinto
8ac7e8c299
fix(chat-ui): model selection toggle and new chat ( #7574 )
...
Fixes a minor glitch that happens when switching model in from the chat
pane where the header was not getting updated. Besides, it allows to
create new chat directly when clicking from the management pane to the
model.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-14 22:29:11 +01:00
LocalAI [bot]
0f5cc4c07b
chore: ⬆️ Update ggml-org/llama.cpp to 5c8a717128cc98aa9e5b1c44652f5cf458fd426e ( #7573 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-14 22:21:54 +01:00
LocalAI [bot]
3e4e6777d8
chore: ⬆️ Update ggml-org/llama.cpp to 5266379bcae74214af397f36aa81b2a08b15d545 ( #7563 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-14 11:41:10 +01:00
Simon Redman
5de539ab07
fix(7355): Update llama-cpp grpc for v3 interface ( #7566 )
...
* fix(7355): Update llama-cpp grpc for v3 interface
Signed-off-by: Simon Redman <simon@ergotech.com >
* feat(llama-gprc): Trim whitespace from servers list
Signed-off-by: Simon Redman <simon@ergotech.com >
* Trim trailing spaces in grpc-server.cpp
Signed-off-by: Simon Redman <simon@ergotech.com >
---------
Signed-off-by: Simon Redman <simon@ergotech.com >
2025-12-14 11:40:33 +01:00
LocalAI [bot]
3013d1c7b5
chore: ⬆️ Update leejet/stable-diffusion.cpp to 43a70e819b9254dee0d017305d6992f6bb27f850 ( #7562 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-13 22:52:20 +01:00
LocalAI [bot]
073b3855d9
chore: ⬆️ Update ggml-org/whisper.cpp to 2551e4ce98db69027d08bd99bcc3f1a4e2ad2cef ( #7561 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-13 21:22:14 +00:00
Ettore Di Giacinto
e1874cdb54
feat(ui): add mask to install custom backends ( #7559 )
...
* feat: allow to install backends from URL in the WebUI and API
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* trace backends installations
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-13 19:11:32 +01:00
Ettore Di Giacinto
7790a24682
Revert "chore(deps): bump torch from 2.5.1+cxx11.abi to 2.7.1+cpu in /backend/python/diffusers in the pip group across 1 directory" ( #7558 )
...
Revert "chore(deps): bump torch from 2.5.1+cxx11.abi to 2.7.1+cpu in /backend…"
This reverts commit 1b4aa6f1be .
2025-12-13 17:04:46 +01:00
dependabot[bot]
1b4aa6f1be
chore(deps): bump torch from 2.5.1+cxx11.abi to 2.7.1+cpu in /backend/python/diffusers in the pip group across 1 directory ( #7549 )
...
chore(deps): bump torch
Bumps the pip group with 1 update in the /backend/python/diffusers directory: torch.
Updates `torch` from 2.5.1+cxx11.abi to 2.7.1+cpu
---
updated-dependencies:
- dependency-name: torch
dependency-version: 2.7.1+cpu
dependency-type: direct:production
dependency-group: pip
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-13 13:12:18 +00:00
Ettore Di Giacinto
504d954aea
Add chardet to requirements-l4t13.txt
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-13 12:59:03 +01:00
Ettore Di Giacinto
1383ad6d6d
Change runner from macOS-14 to macos-latest
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-13 10:11:27 +01:00
Ettore Di Giacinto
5e270ba5bd
Change runner from macOS-14 to macos-latest
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-13 10:10:47 +01:00
Ettore Di Giacinto
6d2a535813
chore(l4t13): use pytorch index ( #7546 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-13 10:04:57 +01:00
Ettore Di Giacinto
abfb0ff8fe
feat(stablediffusion-ggml): add lora support ( #7542 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-13 08:29:06 +01:00
LocalAI [bot]
2bd6faaff5
chore: ⬆️ Update leejet/stable-diffusion.cpp to 11ab095230b2b67210f5da4d901588d56c71fe3a ( #7539 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-12 21:31:13 +00:00
Ettore Di Giacinto
1a9f5da1b7
Update Discord badge with dynamic member count
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-12 12:50:55 +01:00
Ettore Di Giacinto
7f823fce7c
Update Discord badge in README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-12 12:34:57 +01:00
Ettore Di Giacinto
fc5b9ebfcc
feat(loader): enhance single active backend to support LRU eviction ( #7535 )
...
* feat(loader): refactor single active backend support to LRU
This changeset introduces LRU management of loaded backends. Users can
set now a maximum number of models to be loaded concurrently, and, when
setting LocalAI in single active backend mode we set LRU to 1 for
backward compatibility.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: add tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-12 12:28:38 +01:00
LocalAI [bot]
c141a40e00
chore(model-gallery): ⬆️ update checksum ( #7530 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-12 08:16:04 +01:00
Ettore Di Giacinto
0b130fb811
fix(llama.cpp): handle corner cases with tool array content ( #7528 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-12 08:15:45 +01:00
LocalAI [bot]
0771a2d3ec
chore: ⬆️ Update ggml-org/llama.cpp to a81a569577cc38b32558958b048228150be63eae ( #7529 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-11 21:55:44 +00:00
Richard Palethorpe
9441eb509a
chore(makefile): Add buildargs for sd and cuda when building backend ( #7525 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-12-11 20:33:19 +01:00
Ettore Di Giacinto
8442f33712
chore(deps): bump stable-diffusion.cpp to '8823dc48bcc1598eb9671da7b69e45338d0cc5a5' ( #7524 )
...
* chore(deps): bump stable-diffusion.cpp to '8823dc48bcc1598eb9671da7b69e45338d0cc5a5'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(Dockerfile.golang): Make curl noisy to see when download fails
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: Richard Palethorpe <io@richiejp.com >
2025-12-11 20:32:25 +01:00
Ettore Di Giacinto
5dde7e9ac6
fix: make sure to close on errors ( #7521 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-11 14:03:20 +01:00
LocalAI [bot]
72621a1d1c
chore: ⬆️ Update ggml-org/llama.cpp to 4dff236a522bd0ed949331d6cb1ee2a1b3615c35 ( #7508 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-11 08:15:38 +01:00
Ettore Di Giacinto
3b5c2ea633
feat(ui): allow to order search results ( #7507 )
...
* feat(ui): improve table view and let items to be sorted
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* refactorings
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: add tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: use constants
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-11 00:11:33 +01:00
LocalAI [bot]
e1d060d147
chore: ⬆️ Update ggml-org/whisper.cpp to 9f5ed26e43c680bece09df7bdc8c1b7835f0e537 ( #7509 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-10 23:09:13 +01:00
Ettore Di Giacinto
32dcb58e89
feat(vibevoice): add new backend ( #7494 )
...
* feat(vibevoice): add backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: add workflow and backend index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(gallery): add vibevoice
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use self-hosted for intel builds
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Pin python version for l4t
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-10 21:14:21 +01:00
LocalAI [bot]
ef44ace73f
chore: ⬆️ Update ggml-org/llama.cpp to 086a63e3a5d2dbbb7183a74db453459e544eb55a ( #7496 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-10 12:05:13 +01:00
Ettore Di Giacinto
f51d3e380b
fix(config): make syncKnownUsecasesFromString idempotent ( #7493 )
...
fix(config): correctly parse usecases from strings
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-09 21:08:22 +01:00
Ettore Di Giacinto
6cc5cac7b0
fix(downloader): do not download model files if not necessary ( #7492 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-09 19:08:10 +01:00
Ettore Di Giacinto
74ee1463fe
chore(deps/llama-cpp): bump to '2fa51c19b028180b35d316e9ed06f5f0f7ada2c1' ( #7484 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-09 15:41:37 +01:00
LocalAI [bot]
6c7b215687
chore: ⬆️ Update ggml-org/whisper.cpp to a8f45ab11d6731e591ae3d0230be3fec6c2efc91 ( #7483 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-09 08:33:30 +01:00
dependabot[bot]
5e0bc37de3
chore(deps): bump github.com/onsi/gomega from 1.38.2 to 1.38.3 ( #7475 )
...
Bumps [github.com/onsi/gomega](https://github.com/onsi/gomega ) from 1.38.2 to 1.38.3.
- [Release notes](https://github.com/onsi/gomega/releases )
- [Changelog](https://github.com/onsi/gomega/blob/master/CHANGELOG.md )
- [Commits](https://github.com/onsi/gomega/compare/v1.38.2...v1.38.3 )
---
updated-dependencies:
- dependency-name: github.com/onsi/gomega
dependency-version: 1.38.3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-09 01:24:08 +00:00
dependabot[bot]
e28a00c952
chore(deps): bump go.opentelemetry.io/otel/exporters/prometheus from 0.60.0 to 0.61.0 ( #7477 )
...
chore(deps): bump go.opentelemetry.io/otel/exporters/prometheus
Bumps [go.opentelemetry.io/otel/exporters/prometheus](https://github.com/open-telemetry/opentelemetry-go ) from 0.60.0 to 0.61.0.
- [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases )
- [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/exporters/prometheus/v0.60.0...exporters/prometheus/v0.61.0 )
---
updated-dependencies:
- dependency-name: go.opentelemetry.io/otel/exporters/prometheus
dependency-version: 0.61.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-08 23:43:13 +00:00
dependabot[bot]
08f9a52594
chore(deps): bump github.com/mudler/cogito from 0.5.1 to 0.6.0 ( #7474 )
...
Bumps [github.com/mudler/cogito](https://github.com/mudler/cogito ) from 0.5.1 to 0.6.0.
- [Release notes](https://github.com/mudler/cogito/releases )
- [Commits](https://github.com/mudler/cogito/compare/v0.5.1...v0.6.0 )
---
updated-dependencies:
- dependency-name: github.com/mudler/cogito
dependency-version: 0.6.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-08 22:40:33 +01:00
dependabot[bot]
bbce461f57
chore(deps): bump protobuf from 6.33.1 to 6.33.2 in /backend/python/transformers ( #7481 )
...
chore(deps): bump protobuf in /backend/python/transformers
Bumps [protobuf](https://github.com/protocolbuffers/protobuf ) from 6.33.1 to 6.33.2.
- [Release notes](https://github.com/protocolbuffers/protobuf/releases )
- [Commits](https://github.com/protocolbuffers/protobuf/commits )
---
updated-dependencies:
- dependency-name: protobuf
dependency-version: 6.33.2
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-08 22:13:18 +01:00
dependabot[bot]
22e13c362a
chore(deps): bump actions/stale from 10.1.0 to 10.1.1 ( #7473 )
...
Bumps [actions/stale](https://github.com/actions/stale ) from 10.1.0 to 10.1.1.
- [Release notes](https://github.com/actions/stale/releases )
- [Changelog](https://github.com/actions/stale/blob/main/CHANGELOG.md )
- [Commits](5f858e3efb...997185467f )
---
updated-dependencies:
- dependency-name: actions/stale
dependency-version: 10.1.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-08 21:15:37 +01:00
dependabot[bot]
6bd0442698
chore(deps): bump go.opentelemetry.io/otel/sdk/metric from 1.38.0 to 1.39.0 ( #7476 )
...
chore(deps): bump go.opentelemetry.io/otel/sdk/metric
Bumps [go.opentelemetry.io/otel/sdk/metric](https://github.com/open-telemetry/opentelemetry-go ) from 1.38.0 to 1.39.0.
- [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases )
- [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/v1.38.0...v1.39.0 )
---
updated-dependencies:
- dependency-name: go.opentelemetry.io/otel/sdk/metric
dependency-version: 1.39.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-08 19:30:21 +00:00
Ettore Di Giacinto
0380bfe006
Enhance README with video and screenshots
...
Added YouTube video link and screenshots section to README.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-08 17:08:15 +01:00
Ettore Di Giacinto
00a05208bc
chore(docs): center video
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-08 16:59:11 +01:00
Ettore Di Giacinto
4a7cd256c9
Revise 'Screenshots' section to include video
...
Updated section title and added video link for LocalAI.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-08 16:56:34 +01:00
Ettore Di Giacinto
a27d0d151f
Embed YouTube video in documentation
...
Added an embedded YouTube video to the documentation.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-08 16:53:20 +01:00
Ettore Di Giacinto
03a17a2986
fix(paths): remove trailing slash from requests ( #7451 )
...
This removes any ambiguity from how paths are handled, and at the same
time it uniforms the ui paths with the other paths that don't have a
trailing slash
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-07 21:45:09 +01:00
Ettore Di Giacinto
8ca98c90ea
chore(importers/llama.cpp): add models to 'llama-cpp' subfolder ( #7450 )
...
This makes paths predictable, and avoids multiple model files to show in
the main view
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-07 21:44:57 +01:00
Ettore Di Giacinto
18b8956bd9
chore(gallery agent): strip thinking tags ( #7464 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-07 19:25:41 +01:00
Ettore Di Giacinto
262afd28a0
chore(gallery agent): summary now is at root of the git repository ( #7463 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-07 19:23:27 +01:00
LocalAI [bot]
5610384d8a
chore: ⬆️ Update ggml-org/llama.cpp to db97837385edfbc772230debbd49e5efae843a71 ( #7447 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-07 08:32:35 +01:00
rampa3
6aee29d18f
fix(ui): Update few links in web UI from 'browse' to '/browse/' ( #7445 )
...
* Update few links in web UI from 'browse' to '/browse/'
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
* Update core/http/views/404.html
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
* Update core/http/views/error.html
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
* Update core/http/views/manage.html
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
---------
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-06 22:40:26 +01:00
LocalAI [bot]
c3493e4917
chore: ⬆️ Update ggml-org/whisper.cpp to a88b93f85f08fc6045e5d8a8c3f94b7be0ac8bce ( #7448 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-06 21:26:25 +00:00
LocalAI [bot]
edf7141b9b
chore: ⬆️ Update ggml-org/llama.cpp to 8160b38a5fa8a25490ca33ffdd200cda51405688 ( #7438 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-06 13:35:24 +01:00
Ettore Di Giacinto
446b686470
Update model version in gallery-agent workflow
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-05 22:08:16 +01:00
Ettore Di Giacinto
b287944f07
Add Proto Dependencies installation step
...
Added steps to install protobuf and Go dependencies in the GitHub Actions workflow.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-05 21:40:36 +01:00
LocalAI [bot]
f3ae358689
chore(model-gallery): ⬆️ update checksum ( #7437 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-05 15:20:21 +01:00
Richard Palethorpe
c7aaeab683
fix(stablediffusion-ggml): Correct Z-Image model name ( #7436 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-12-05 14:57:39 +01:00
Ettore Di Giacinto
024aa6a55b
chore(deps): bump llama.cpp to 'bde188d60f58012ada0725c6dd5ba7c69fe4dd87' ( #7434 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-05 00:17:35 +01:00
Ettore Di Giacinto
7ce8a56e96
chore(ci/agent): correctly invoke go run
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-04 23:12:04 +01:00
Ettore Di Giacinto
3e9ed48432
chore(ci/agent): support quantization
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-04 22:56:35 +01:00
Ettore Di Giacinto
963796ff51
Update localai-github-action to version 1.1
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-04 22:50:55 +01:00
Ettore Di Giacinto
6bd9a304bc
Add local AI model to gallery agent workflow
...
Updated the GitHub Actions workflow to include the local AI model and modified environment variables for the gallery agent.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-04 22:43:31 +01:00
Ettore Di Giacinto
7990c7a401
chore(agent): update gallery agent to use importers
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-04 22:23:43 +01:00
LocalAI [bot]
4bb93b1c4c
chore(model-gallery): ⬆️ update checksum ( #7433 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-04 21:23:26 +01:00
Copilot
1abbedd732
feat(diffusers): implement dynamic pipeline loader to remove per-pipeline conditionals ( #7365 )
...
* Initial plan
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add dynamic loader for diffusers pipelines and refactor backend.py
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix pipeline discovery error handling and test mock issue
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Address code review feedback: direct imports, better error handling, improved tests
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Address remaining code review feedback: specific exceptions, registry access, test imports
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add defensive fallback for DiffusionPipeline registry access
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Actually use dynamic pipeline loading for all pipelines in backend
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use dynamic loader consistently for all pipelines including AutoPipelineForText2Image
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Move dynamic loader tests into test.py for CI compatibility
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Extend dynamic loader to discover any diffusers class type, not just DiffusionPipeline
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add AutoPipeline classes to pipeline registry for default model loading
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(python): set pyvenv python home
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* do pyenv update during start
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Minor changes
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-04 19:02:06 +01:00
Ettore Di Giacinto
92ee8c2256
fix(ui): prevent box overflow in chat view ( #7430 )
...
Otherwise tool call and result might overflow the box
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-04 17:21:17 +01:00
Ettore Di Giacinto
78105e6b20
chore(ui): uniform buttons ( #7429 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-04 17:18:51 +01:00
Richard Palethorpe
c2e4a1f29b
feat(stablediffusion): Passthrough more parameters to support z-image and flux2 ( #7419 )
...
* feat(stablediffusion): Passthrough more parameters to support z-image and flux2
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* chore(z-image): Add Z-Image-Turbo GGML to library
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(stablediffusion-ggml): flush stderr and check errors when writing PNG
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(stablediffusion-ggml): Re-allocate Go strings in C++
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(stablediffusion-ggml): Try to avoid segfaults
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(stablediffusion-ggml): Init sample and easycache params
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-04 17:08:21 +01:00
Ettore Di Giacinto
100ebdfa2c
chore(ci): do not overload the apple tests
...
Skip tests that are already run on other jobs and not really adding anything here. We have already functional tests that cover apple.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-04 14:15:15 +01:00
LocalAI [bot]
ca2e878aaf
chore: ⬆️ Update ggml-org/llama.cpp to e9f9483464e6f01d843d7f0293bd9c7bc6b2221c ( #7421 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-04 11:54:01 +01:00
Igor B. Poretsky
96e123d53a
Messages output fix ( #7424 )
...
The internal echo command in sh does not support "-e" and "-E" options
and interprets backslash escape sequences by default. So we prefer the
external echo command when it is available.
2025-12-04 11:30:02 +01:00
LocalAI [bot]
7c5a0cde64
chore: ⬆️ Update leejet/stable-diffusion.cpp to 5865b5e7034801af1a288a9584631730b25272c6 ( #7422 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-04 11:29:16 +01:00
Ettore Di Giacinto
edcbf82b31
chore(ci): add wget
2025-12-04 10:01:34 +01:00
Ettore Di Giacinto
6558caca85
chore(ci): adapt also golang-based backends docker images
2025-12-04 09:14:08 +01:00
Ettore Di Giacinto
b4172762d7
chore(ci): do override pip in 24.04
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-03 22:54:13 +01:00
Ettore Di Giacinto
dc6182bbb1
chore(ci): add wget to llama-cpp docker image builder
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-03 22:48:41 +01:00
Ettore Di Giacinto
1d1d52da59
chore(ci): small fixups to build arm64 images
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-03 21:42:33 +01:00
Ettore Di Giacinto
46b1a1848f
chore(ci): minor fixup
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-03 16:47:31 +01:00
LocalAI [bot]
957eea3da3
chore: ⬆️ Update ggml-org/llama.cpp to 61bde8e21f4a1f9a98c9205831ca3e55457b4c78 ( #7415 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-03 16:27:12 +01:00
Ettore Di Giacinto
ab4f2742a6
chore(ci): minor fixup
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-03 16:26:33 +01:00
Ettore Di Giacinto
03f3bf2d94
chore(ci): only install runtime libs needed on arm64
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-03 15:13:21 +01:00
Ettore Di Giacinto
774ddc60db
chore(ci): specify ubuntu version in pipelines
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-03 11:10:18 +01:00
Ettore Di Giacinto
0ca1322b43
chore(ci): correctly pass ubuntu-version
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-03 09:58:10 +01:00
Ettore Di Giacinto
8dfeea2f55
fix: use ubuntu 24.04 for cuda13 l4t images ( #7418 )
...
* fix: use ubuntu 24.04 for cuda13 l4t images
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop openblas from containers
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-03 09:47:03 +01:00
Ettore Di Giacinto
fea9018dc5
Revert "feat(stablediffusion): Passthrough more parameters to support z-image and flux2" ( #7417 )
...
Revert "feat(stablediffusion): Passthrough more parameters to support z-image…"
This reverts commit 4018e59b2a .
2025-12-02 22:14:28 +01:00
Ettore Di Giacinto
d8c7e90a69
Add Dockerfile for arm64 with nvpl installation ( #7416 )
...
Added installation of nvpl and updated apt-get commands for arm64 architecture.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-02 21:55:42 +01:00
Ettore Di Giacinto
c045b7a6bb
Update Dockerfile to install cudss package
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-02 21:23:21 +01:00
Ettore Di Giacinto
7a5c61b057
fix: configure sbsa packages for arm64 ( #7413 )
...
* fix: configure sbsa packages for arm64
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-02 18:59:36 +01:00
Richard Palethorpe
4018e59b2a
feat(stablediffusion): Passthrough more parameters to support z-image and flux2 ( #7414 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-12-02 18:28:26 +01:00
Richard Palethorpe
aaece6685f
chore(deps/stable-diffusion-ggml): update stablediffusion-ggml ( #7411 )
...
* ⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* fix(stablediffusion-ggml): fixup schedulers and samplers arrays, use default getters
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-12-02 16:35:39 +01:00
Ettore Di Giacinto
f5df806f35
Fixup tags
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-02 15:15:41 +01:00
Ettore Di Giacinto
cfd95745ed
feat: add cuda13 images ( #7404 )
...
* chore(ci): add cuda13 jobs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add to pipelines and to capabilities. Start to work on the gallery
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* gallery
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* capabilities: try to detect by looking at /usr/local
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* neutts
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* backends.yaml
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* add cuda13 l4t requirements.txt
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* add cuda13 requirements.txt
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Pin vllm
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Not all backends are compatible
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* add vllm to requirements
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* vllm is not pre-compiled for cuda 13
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-02 14:24:35 +01:00
dependabot[bot]
9872bdf455
chore(deps): bump appleboy/ssh-action from 1.2.3 to 1.2.4 ( #7410 )
...
Bumps [appleboy/ssh-action](https://github.com/appleboy/ssh-action ) from 1.2.3 to 1.2.4.
- [Release notes](https://github.com/appleboy/ssh-action/releases )
- [Commits](https://github.com/appleboy/ssh-action/compare/v1.2.3...v1.2.4 )
---
updated-dependencies:
- dependency-name: appleboy/ssh-action
dependency-version: 1.2.4
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-02 08:00:16 +01:00
LocalAI [bot]
665441ca94
chore: ⬆️ Update ggml-org/llama.cpp to ec18edfcba94dacb166e6523612fc0129cead67a ( #7406 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-12-02 07:59:52 +01:00
dependabot[bot]
60f50a356f
chore(deps): bump github.com/google/go-containerregistry from 0.19.2 to 0.20.7 ( #7409 )
...
chore(deps): bump github.com/google/go-containerregistry
Bumps [github.com/google/go-containerregistry](https://github.com/google/go-containerregistry ) from 0.19.2 to 0.20.7.
- [Release notes](https://github.com/google/go-containerregistry/releases )
- [Commits](https://github.com/google/go-containerregistry/compare/v0.19.2...v0.20.7 )
---
updated-dependencies:
- dependency-name: github.com/google/go-containerregistry
dependency-version: 0.20.7
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-12-01 22:55:30 +00:00
Ettore Di Giacinto
045baf7fd2
fix(ui): navbar ordering and login icon ( #7407 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-01 21:20:11 +01:00
Ettore Di Giacinto
8a54ffa668
fix: do not require auth for readyz/healthz endpoints ( #7403 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-01 10:35:28 +01:00
Ettore Di Giacinto
e3bcba5c45
chore: ⬆️ Update ggml-org/llama.cpp to 7f8ef50cce40e3e7e4526a3696cb45658190e69a ( #7402 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-12-01 07:50:40 +01:00
LocalAI [bot]
17d84c8556
feat(swagger): update swagger ( #7400 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-30 21:39:29 +00:00
Ettore Di Giacinto
a3423f33e1
feat(agent-jobs): add multimedia support ( #7398 )
...
* feat(agent-jobs): add multimedia support
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Refactoring
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-30 14:09:25 +01:00
Ettore Di Giacinto
45ee10ec50
feat(hf-api): return files in nested directories ( #7396 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-30 09:06:54 +01:00
LocalAI [bot]
0824fd8efd
chore: ⬆️ Update ggml-org/llama.cpp to 8c32d9d96d9ae345a0150cae8572859e9aafea0b ( #7395 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-30 09:06:18 +01:00
LocalAI [bot]
a9b8869964
feat(swagger): update swagger ( #7394 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-30 09:05:46 +01:00
Ettore Di Giacinto
54b5dfa8e1
chore: refactor css, restyle to be slightly minimalistic ( #7397 )
...
restyle
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-29 22:11:44 +01:00
Ettore Di Giacinto
468ac608f3
chore(deps): bump llama.cpp to 'd82b7a7c1d73c0674698d9601b1bbb0200933f29' ( #7392 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-29 08:58:07 +01:00
Ettore Di Giacinto
53e5b2d6be
feat: agent jobs panel ( #7390 )
...
* feat(agent): agent jobs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Multiple webhooks, simplify
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Do not use cron with seconds
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Create separate pages for details
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Detect if no models have MCP configuration, show wizard
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make services test to run
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-28 23:05:39 +01:00
Ettore Di Giacinto
4b5977f535
chore: drop pinning of python 3.12 ( #7389 )
...
Update install.sh
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-28 11:02:56 +01:00
Ettore Di Giacinto
0d877b1e71
Revert "chore(l4t): Update extra index URL for requirements-l4t.txt" ( #7388 )
...
Revert "chore(l4t): Update extra index URL for requirements-l4t.txt (#7383 )"
This reverts commit 0d781e6b7e .
2025-11-28 11:02:11 +01:00
Ettore Di Giacinto
e27f1370eb
chore(diffusers): Add PY_STANDALONE_TAG for l4t Python version ( #7387 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-28 09:34:05 +01:00
LocalAI [bot]
1a53fd2b9b
chore: ⬆️ Update ggml-org/llama.cpp to 4abef75f2cf2eee75eb5083b30a94cf981587394 ( #7382 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-28 00:08:27 +01:00
Ettore Di Giacinto
e01d821314
chore: Add Python 3.12 support for l4t build profile ( #7384 )
...
Set Python version to 3.12 for l4t build profile.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-27 23:00:09 +01:00
Ettore Di Giacinto
0d781e6b7e
chore(l4t): Update extra index URL for requirements-l4t.txt ( #7383 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-27 22:02:06 +01:00
LocalAI [bot]
4c41f96157
docs: ⬆️ update docs version mudler/LocalAI ( #7381 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-27 21:49:31 +01:00
Igor B. Poretsky
a8eb1c421b
Clean data directory ( #7378 )
...
It seems to be no point to copy /etc/skel content to newly created data
directory.
2025-11-27 17:48:32 +01:00
Igor B. Poretsky
d27a281783
Correct user deletion with all its data ( #7368 )
...
Actually it is not necessary to remove particularly the local-ai data
directory before user deletion. It will be accomplished automatically by
the userdel command. But it is crucial to remove additional users from
the local-ai group to allow userdel command to delete the group itself.
2025-11-27 17:47:55 +01:00
Igor B. Poretsky
c411fe09fb
Conventional way of adding extra apt repository ( #7362 )
2025-11-27 17:46:26 +01:00
Ettore Di Giacinto
7ccc383a8b
chore(l4t/diffusers): bump nvidia l4t index for pytorch 2.9 ( #7379 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-27 17:42:01 +01:00
Ettore Di Giacinto
2f8a2b1297
chore(deps): update diffusers dependency to use GitHub repo for l4t ( #7369 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-27 16:02:48 +01:00
Igor B. Poretsky
acbcb44dbc
Initialize sudo reference before its first actual use ( #7367 )
...
Unfortunately, in my previous pr I missed the fact that uninstall
procedure uses sudo as well. La colpa mia.
2025-11-27 15:20:46 +01:00
Igor B. Poretsky
ab022172a9
chore: switch from /usr/share to /var/lib for data storage ( #7361 )
...
* More appropriate place for data storing
The /usr/share subtree in Linux is used for data that generally are not
supposed to change. Conventional places for changeable data are usually
located under /var, so /var/lib seems to be a reasonable default here.
* Data paths consistency fix
* Directory name consistency fix
2025-11-27 09:18:28 +01:00
LocalAI [bot]
b5f4f4ac6d
chore: ⬆️ Update ggml-org/llama.cpp to eec1e33a9ed71b79422e39cc489719cf4f8e0777 ( #7363 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-27 09:17:25 +01:00
Igor B. Poretsky
c0d1d0211f
fix: Initialize sudo reference before its first actual use ( #7360 )
2025-11-26 16:03:42 +01:00
Igor B. Poretsky
f617bec686
fix: double sudo invocation fix in the install script ( #7359 )
...
Double sudo invocation fix in the install script
2025-11-26 16:03:10 +01:00
Ettore Di Giacinto
7a94d237c4
chore(deps): bump llama.cpp to '583cb83416467e8abf9b37349dcf1f6a0083745a ( #7358 )
...
chore(deps): bump llama.cpp to '583cb83416467e8abf9b37349dcf1f6a0083745a'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-26 08:23:21 +01:00
LocalAI [bot]
304ac94d01
feat(swagger): update swagger ( #7356 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-25 22:19:53 +01:00
Ettore Di Giacinto
f9f9b9d444
Update project news section in README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-25 19:31:05 +01:00
dependabot[bot]
70d78b9fd4
chore(deps): bump google.golang.org/grpc from 1.76.0 to 1.77.0 ( #7343 )
...
Bumps [google.golang.org/grpc](https://github.com/grpc/grpc-go ) from 1.76.0 to 1.77.0.
- [Release notes](https://github.com/grpc/grpc-go/releases )
- [Commits](https://github.com/grpc/grpc-go/compare/v1.76.0...v1.77.0 )
---
updated-dependencies:
- dependency-name: google.golang.org/grpc
dependency-version: 1.77.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-24 21:18:41 +01:00
dependabot[bot]
91248da09e
chore(deps): bump actions/checkout from 5 to 6 ( #7339 )
...
Bumps [actions/checkout](https://github.com/actions/checkout ) from 5 to 6.
- [Release notes](https://github.com/actions/checkout/releases )
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md )
- [Commits](https://github.com/actions/checkout/compare/v5...v6 )
---
updated-dependencies:
- dependency-name: actions/checkout
dependency-version: '6'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-24 21:18:15 +01:00
Gregory Mariani
745c31e013
feat(inpainting): add inpainting endpoint, wire ImageGenerationFunc and return generated image URL ( #7328 )
...
feat(inpainting): add inpainting endpoint with automatic model selection
Signed-off-by: Greg <marianigregory@pm.me >
2025-11-24 21:13:54 +01:00
dependabot[bot]
7e01aa8faa
chore(deps): bump protobuf from 6.32.0 to 6.33.1 in /backend/python/transformers ( #7340 )
...
chore(deps): bump protobuf in /backend/python/transformers
Bumps [protobuf](https://github.com/protocolbuffers/protobuf ) from 6.32.0 to 6.33.1.
- [Release notes](https://github.com/protocolbuffers/protobuf/releases )
- [Changelog](https://github.com/protocolbuffers/protobuf/blob/main/protobuf_release.bzl )
- [Commits](https://github.com/protocolbuffers/protobuf/commits )
---
updated-dependencies:
- dependency-name: protobuf
dependency-version: 6.33.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-24 20:12:17 +00:00
Ettore Di Giacinto
aceebf81d6
chore(ui): fix slider overflow
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-24 14:43:38 +01:00
Ettore Di Giacinto
71ed03102f
feat(ui): add chat history ( #7325 )
...
* feat(chat): add history and management
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Display in progress chats
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fetch available context size as we switch chat
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add search
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Display MCP toggle correctly
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Re-ordering
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Re-style
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Stable ordering
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Display token/sec correctly
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Visual changes
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Display chat time
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-24 11:48:24 +01:00
LocalAI [bot]
f6d2a52cd5
chore: ⬆️ Update ggml-org/llama.cpp to 0c7220db56525d40177fcce3baa0d083448ec813 ( #7337 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-24 09:11:38 +01:00
LocalAI [bot]
05a00b2399
chore: ⬆️ Update ggml-org/llama.cpp to 3f3a4fb9c3b907c68598363b204e6f58f4757c8c ( #7336 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-22 21:53:40 +00:00
Ettore Di Giacinto
3a232446e0
Revert "chore(chatterbox): bump l4t index to support more recent pytorch" ( #7333 )
...
Revert "chore(chatterbox): bump l4t index to support more recent pytorch (#7332 )"
This reverts commit 55607a5aac .
2025-11-22 10:10:27 +01:00
LocalAI [bot]
bdfe8431fa
chore: ⬆️ Update ggml-org/llama.cpp to 23bc779a6e58762ea892eca1801b2ea1b9050c00 ( #7331 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-22 08:44:01 +01:00
Ettore Di Giacinto
55607a5aac
chore(chatterbox): bump l4t index to support more recent pytorch ( #7332 )
...
This should add support for devices like the DGX Spark
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-21 22:24:46 +01:00
Ettore Di Giacinto
ec492a4c56
fix(typo): environment variable name for max jobs
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-21 18:37:22 +01:00
Ettore Di Giacinto
2defe98df8
fix(vllm): Update flash-attn to specific wheel URL
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-21 18:06:46 +01:00
Filipe Oliveira
b406b088a7
fix: Update Installer Options URL ( #7330 )
2025-11-21 17:29:36 +01:00
Ettore Di Giacinto
6261c87b1b
Add NVCC_THREADS and MAX_JOB environment variables
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-21 16:14:13 +01:00
Ettore Di Giacinto
fa00aa0085
chore(ci): add OS check to skip test if not on Linux
...
Skip test on non-Linux operating systems.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-21 15:01:04 +01:00
Ettore Di Giacinto
0e53ce60b4
chore(ci): remove context size configuration from application
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-21 14:57:32 +01:00
Ettore Di Giacinto
8aba078439
chore(tests): add context size option to application initialization
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-21 09:50:05 +01:00
Ettore Di Giacinto
e88db7d142
fix(llama.cpp): handle corner cases with tool content ( #7324 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-21 09:21:49 +01:00
LocalAI [bot]
b7b8a0a748
chore: ⬆️ Update ggml-org/llama.cpp to dd0f3219419b24740864b5343958a97e1b3e4b26 ( #7322 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-21 08:11:47 +01:00
Ettore Di Giacinto
dd2828241c
chore(docs): add documentation about import ( #7315 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-20 23:07:36 +01:00
LocalAI [bot]
b8011f49f2
chore: ⬆️ Update ggml-org/whisper.cpp to 19ceec8eac980403b714d603e5ca31653cd42a3f ( #7321 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-20 23:07:22 +01:00
Copilot
16e5689162
feat(importers): Add diffuser backend importer with ginkgo tests and UI support ( #7316 )
...
* Initial plan
* Add diffuser backend importer with ginkgo tests
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Finalize diffuser backend importer implementation
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Add diffuser preferences to model-editor import section
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Use gopkg.in/yaml.v3 for consistency in diffuser importer
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-20 22:38:30 +01:00
Ettore Di Giacinto
2dd42292dc
feat(ui): runtime settings ( #7320 )
...
* feat(ui): add watchdog settings
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Do not re-read env
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Some refactor, move other settings to runtime (p2p)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add API Keys handling
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Allow to disable runtime settings
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Documentation
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* show MCP toggle in index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop context default
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-20 22:37:20 +01:00
Ettore Di Giacinto
53d51671d7
Update Docker installation recommendation wording
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-20 17:27:48 +01:00
Ettore Di Giacinto
daf39e1efd
chore(vllm/ci): set maximum number of jobs
...
Also added comments to clarify CPU usage during build.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-20 15:53:32 +01:00
Ettore Di Giacinto
382474e4a1
fix: do not delete files if used by other configured models ( #7235 )
...
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: prevent deletion of model files shared by multiple configurations (#7317 )
* Initial plan
* fix: do not delete files if used by other configured models
- Fixed bug in DeleteModelFromSystem where OR was used instead of AND for file suffix check
- Fixed bug where model config filename comparison was incorrect
- Added comprehensive Ginkgo test to verify shared model files are not deleted
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* fix: prevent deletion of model files shared by multiple configurations
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-20 14:55:51 +01:00
Ettore Di Giacinto
5fed9c6596
chore(ci): move intel image builds to self-hosted
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-20 09:36:54 +01:00
LocalAI [bot]
bfa07df7cd
chore: ⬆️ Update ggml-org/llama.cpp to 7d77f07325985c03a91fa371d0a68ef88a91ec7f ( #7314 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-20 07:58:42 +01:00
dependabot[bot]
fbaa21b0e5
chore(deps): bump golang.org/x/crypto from 0.43.0 to 0.45.0 in the go_modules group across 1 directory ( #7319 )
...
chore(deps): bump golang.org/x/crypto
Bumps the go_modules group with 1 update in the / directory: [golang.org/x/crypto](https://github.com/golang/crypto ).
Updates `golang.org/x/crypto` from 0.43.0 to 0.45.0
- [Commits](https://github.com/golang/crypto/compare/v0.43.0...v0.45.0 )
---
updated-dependencies:
- dependency-name: golang.org/x/crypto
dependency-version: 0.45.0
dependency-type: indirect
dependency-group: go_modules
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-20 04:19:22 +00:00
Ettore Di Giacinto
95b6c9bb5a
Update docs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-19 22:25:33 +01:00
Ettore Di Giacinto
2cc4809b0d
feat: docs revamp ( #7313 )
...
* docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small enhancements
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Enhancements
* Default to zen-dark
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-19 22:21:20 +01:00
Ettore Di Giacinto
77bbeed57e
feat(importer): unify importing code with CLI ( #7299 )
...
* feat(importer): support ollama and OCI, unify code
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: support importing from local file
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* support also yaml config files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Correctly handle local files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Extract importing errors
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add importer tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add integration tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(UX): improve and specify supported URI formats
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fail if backend does not have a runfile
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(gallery): add cache for galleries
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(ui): remove handler duplicate
File input handlers are now handled by Alpine.js @change handlers in chat.html.
Removed duplicate listeners to prevent files from being processed twice
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(ui): be consistent in attachments in the chat
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fail if no importer matches
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: propagate ops correctly
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-19 20:52:11 +01:00
Ettore Di Giacinto
3152611184
chore(deps): bump llama.cpp to '10e9780154365b191fb43ca4830659ef12def80f ( #7311 )
...
chore(deps): bump llama.cpp to '10e9780154365b191fb43ca4830659ef12def80f'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-19 14:42:11 +01:00
Ettore Di Giacinto
30f992f241
feat(ui): add backend reinstall button ( #7305 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-18 14:52:54 +01:00
ErixM
2709220b84
fix the tts model dropdown to show the currently selected model ( #7306 )
...
* fix the tts model dropdown to show the currently selected model
* Update core/config/model_config.go
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
---------
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
Co-authored-by: Erixhens Muka <erixhens.muka@bluetensor.ai >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-18 14:49:03 +01:00
LocalAI [bot]
4278506876
chore: ⬆️ Update ggml-org/llama.cpp to cb623de3fc61011e5062522b4d05721a22f2e916 ( #7301 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-18 07:43:57 +01:00
LocalAI [bot]
1dd1d12da1
chore: ⬆️ Update ggml-org/whisper.cpp to b12abefa9be2abae39a73fa903322af135024a36 ( #7300 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-18 07:43:33 +01:00
dependabot[bot]
3a5b3bb0a6
chore(deps): bump google.golang.org/protobuf from 1.36.8 to 1.36.10 ( #7295 )
...
Bumps google.golang.org/protobuf from 1.36.8 to 1.36.10.
---
updated-dependencies:
- dependency-name: google.golang.org/protobuf
dependency-version: 1.36.10
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-17 22:25:55 +01:00
dependabot[bot]
94d9fc923f
chore(deps): bump github.com/alecthomas/kong from 1.12.1 to 1.13.0 ( #7296 )
...
Bumps [github.com/alecthomas/kong](https://github.com/alecthomas/kong ) from 1.12.1 to 1.13.0.
- [Commits](https://github.com/alecthomas/kong/compare/v1.12.1...v1.13.0 )
---
updated-dependencies:
- dependency-name: github.com/alecthomas/kong
dependency-version: 1.13.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-17 20:39:14 +01:00
dependabot[bot]
6fcf2c50b6
chore(deps): bump go.yaml.in/yaml/v2 from 2.4.2 to 2.4.3 ( #7294 )
...
Bumps [go.yaml.in/yaml/v2](https://github.com/yaml/go-yaml ) from 2.4.2 to 2.4.3.
- [Commits](https://github.com/yaml/go-yaml/compare/v2.4.2...v2.4.3 )
---
updated-dependencies:
- dependency-name: go.yaml.in/yaml/v2
dependency-version: 2.4.3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-17 20:37:19 +01:00
dependabot[bot]
7cbd4a2f18
chore(deps): bump fyne.io/fyne/v2 from 2.7.0 to 2.7.1 ( #7293 )
...
Bumps [fyne.io/fyne/v2](https://github.com/fyne-io/fyne ) from 2.7.0 to 2.7.1.
- [Release notes](https://github.com/fyne-io/fyne/releases )
- [Changelog](https://github.com/fyne-io/fyne/blob/master/CHANGELOG.md )
- [Commits](https://github.com/fyne-io/fyne/compare/v2.7.0...v2.7.1 )
---
updated-dependencies:
- dependency-name: fyne.io/fyne/v2
dependency-version: 2.7.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-17 20:37:07 +01:00
Ettore Di Giacinto
18d11396cd
chore(docs): improve documentation and split into sections bigger topics ( #7292 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-17 18:39:21 +01:00
Ettore Di Giacinto
93cd688f40
chore: small ux enhancements ( #7290 )
...
* chore: improve chat attachments
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: display installed backends/models
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-17 17:09:42 +01:00
Ettore Di Giacinto
721c3f962b
chore: scroll in thinking mode, better buttons placement ( #7289 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-17 16:21:14 +01:00
LocalAI [bot]
fb834805db
chore: ⬆️ Update ggml-org/llama.cpp to 80deff3648b93727422461c41c7279ef1dac7452 ( #7287 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-17 07:51:08 +01:00
LocalAI [bot]
839aa7b42b
feat(swagger): update swagger ( #7286 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-17 07:49:06 +01:00
Ettore Di Giacinto
e963a45d66
feat(index): minor enhancements ( #7288 )
...
* feat(ui): add placeholder effect and select first model by default
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(ui): correctly bind focus to parent
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-16 21:23:18 +01:00
Mikhail Khludnev
c313b2c671
fix(reranker): tests and top_n check fix #7212 ( #7284 )
...
reranker tests and top_n check fix #7212
Signed-off-by: Mikhail Khludnev <mkhl@apache.org >
2025-11-16 17:53:23 +01:00
Ettore Di Giacinto
137f16336e
feat(ui): small refinements ( #7285 )
...
* feat(ui): show loaded models in the index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(ui): re-organize navbar
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-16 17:50:13 +01:00
Ettore Di Giacinto
d7f9f3ac93
feat: add support to logitbias and logprobs ( #7283 )
...
* feat: add support to logprobs in results
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: add support to logitbias
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-16 13:27:36 +01:00
Ettore Di Giacinto
cd7d384500
feat: restyle index ( #7282 )
...
* Move management to separate section
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make index to redirect to chat
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use logo in index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* work out the wizard in the front-page
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-16 11:01:05 +01:00
LocalAI [bot]
d1a0dd10e6
chore: ⬆️ Update ggml-org/llama.cpp to 662192e1dcd224bc25759aadd0190577524c6a66 ( #7277 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-16 08:41:12 +01:00
Ettore Di Giacinto
be8cf838c2
feat(importers): add transformers and vLLM ( #7278 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-15 22:47:09 +01:00
LocalAI [bot]
3276d1cdaf
feat(swagger): update swagger ( #7276 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-15 21:50:30 +01:00
Ettore Di Giacinto
5e5f01badd
chore(ui): import vendored libs ( #7281 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-15 21:49:52 +01:00
Ettore Di Giacinto
6d0f646c37
chore: guide the user to import models ( #7280 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-15 21:37:50 +01:00
Ettore Di Giacinto
99d31667f8
chore: do not use placeholder image ( #7279 )
...
Use font-awesome icons instead
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-15 21:31:58 +01:00
Ettore Di Giacinto
47b546afdc
feat(mcp): add LocalAI endpoint to stream live results of the agent ( #7274 )
...
* feat(mcp): add LocalAI endpoint to stream live results of the agent
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* wip
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Refactoring
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* MCP UX integration
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Enhance UX
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Support also non-SSE
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-15 17:38:00 +01:00
LocalAI [bot]
a09d49da43
chore: ⬆️ Update ggml-org/llama.cpp to 9b17d74ab7d31cb7d15ee7eec1616c3d825a84c0 ( #7273 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-15 00:05:39 +01:00
Ettore Di Giacinto
1cdcaf0152
feat: migrate to echo and enable cancellation of non-streaming requests ( #7270 )
...
* WIP: migrate to echo
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-14 22:57:53 +01:00
Ettore Di Giacinto
03e9f4b140
fix: handle tool errors ( #7271 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-14 17:23:56 +01:00
Ettore Di Giacinto
7129409bf6
chore(deps): bump llama.cpp to c4abcb2457217198efdd67d02675f5fddb7071c2 ( #7266 )
...
* chore(deps): bump llama.cpp to '92bb442ad999a0d52df0af2730cd861012e8ac5c'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* DEBUG
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Bump
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* test/debug
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Revert "DEBUG"
This reverts commit 2501ca3ff242076d623c13c86b3d6afcec426281.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-14 12:16:52 +01:00
LocalAI [bot]
d9e9ec6825
chore: ⬆️ Update ggml-org/whisper.cpp to d9b7613b34a343848af572cc14467fc5e82fc788 ( #7268 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-13 23:05:06 +01:00
LocalAI [bot]
b82645d28d
feat(swagger): update swagger ( #7267 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-13 21:28:10 +00:00
Ettore Di Giacinto
735ca757fa
feat(ui): allow to cancel ops ( #7264 )
...
* feat(ui): allow to cancel ops
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Improve progress text
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Cancel queued ops, don't show up message cancellation always
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: fixup displaying of total progress over multiple files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-13 18:41:47 +01:00
Ettore Di Giacinto
b1d1f2a37d
chore(importers): small logic enhancements ( #7262 )
...
* chore(import): import mmproj files to specific folder
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Slightly enhance logics
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-12 22:08:08 +01:00
Ettore Di Giacinto
3728552e94
feat: import models via URI ( #7245 )
...
* feat: initial hook to install elements directly
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP: ui changes
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Move HF api client to pkg
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add simple importer for gguf files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add opcache
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* wire importers to CLI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add omitempty to config fields
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add MLX importer
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small refactors to star to use HF for discovery
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Common preferences
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add support to bare HF repos
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(importer/llama.cpp): add support for mmproj files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* add mmproj quants to common preferences
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix vlm usage in tokenizer mode with llama.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-12 20:48:56 +01:00
Shohidul Bari
87d0020c10
chore: display file names in model preview ( #7251 )
...
* chore: add file names in the model response
Signed-off-by: shohidulbari <shohidulbari18@gmail.com >
* chore: show file names in model preview
Signed-off-by: shohidulbari <shohidulbari18@gmail.com >
* chore: fix prettier format
Signed-off-by: shohidulbari <shohidulbari18@gmail.com >
---------
Signed-off-by: shohidulbari <shohidulbari18@gmail.com >
2025-11-12 09:20:54 +01:00
LocalAI [bot]
a8eb537071
chore(model-gallery): ⬆️ update checksum ( #7248 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-12 09:14:09 +01:00
Mikhail Khludnev
04fe0b0da8
fix(reranker): llama-cpp sort score desc, crop top_n ( #7211 )
...
Signed-off-by: Mikhail Khludnev <mkhl@apache.org >
2025-11-12 09:13:01 +01:00
LocalAI [bot]
fae93e5ba2
chore: ⬆️ Update ggml-org/llama.cpp to 7d019cff744b73084b15ca81ba9916f3efab1223 ( #7247 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-11 21:31:01 +00:00
LocalAI [bot]
b606034243
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7237 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-11 09:19:19 +01:00
LocalAI [bot]
5f4663252d
chore: ⬆️ Update ggml-org/llama.cpp to 13730c183b9e1a32c09bf132b5367697d6c55048 ( #7232 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-11 00:03:01 +01:00
LocalAI [bot]
80bb7c5f67
chore: ⬆️ Update ggml-org/whisper.cpp to a1867e0dad0b21b35afa43fc815dae60c9a139d6 ( #7231 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-10 22:51:49 +01:00
Ettore Di Giacinto
f6881ea023
fix(edit): propagate correctly opts when reloading ( #7233 )
...
fix(edit): propagate correctly ops when reloading
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-10 22:06:32 +01:00
dependabot[bot]
5651a19aa1
chore(deps): bump github.com/ebitengine/purego from 0.9.0 to 0.9.1 ( #7229 )
...
Bumps [github.com/ebitengine/purego](https://github.com/ebitengine/purego ) from 0.9.0 to 0.9.1.
- [Release notes](https://github.com/ebitengine/purego/releases )
- [Commits](https://github.com/ebitengine/purego/compare/v0.9.0...v0.9.1 )
---
updated-dependencies:
- dependency-name: github.com/ebitengine/purego
dependency-version: 0.9.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-10 20:16:32 +00:00
dependabot[bot]
c834cdb826
chore(deps): bump github.com/testcontainers/testcontainers-go from 0.38.0 to 0.40.0 ( #7230 )
...
chore(deps): bump github.com/testcontainers/testcontainers-go
Bumps [github.com/testcontainers/testcontainers-go](https://github.com/testcontainers/testcontainers-go ) from 0.38.0 to 0.40.0.
- [Release notes](https://github.com/testcontainers/testcontainers-go/releases )
- [Commits](https://github.com/testcontainers/testcontainers-go/compare/v0.38.0...v0.40.0 )
---
updated-dependencies:
- dependency-name: github.com/testcontainers/testcontainers-go
dependency-version: 0.40.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-10 19:40:57 +01:00
dependabot[bot]
fa2caef63d
chore(deps): bump github.com/docker/docker from 28.5.1+incompatible to 28.5.2+incompatible ( #7228 )
...
chore(deps): bump github.com/docker/docker
Bumps [github.com/docker/docker](https://github.com/docker/docker ) from 28.5.1+incompatible to 28.5.2+incompatible.
- [Release notes](https://github.com/docker/docker/releases )
- [Commits](https://github.com/docker/docker/compare/v28.5.1...v28.5.2 )
---
updated-dependencies:
- dependency-name: github.com/docker/docker
dependency-version: 28.5.2+incompatible
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-10 19:40:26 +01:00
dependabot[bot]
31abc799f9
chore(deps): bump github.com/jaypipes/ghw from 0.19.1 to 0.20.0 ( #7227 )
...
Bumps [github.com/jaypipes/ghw](https://github.com/jaypipes/ghw ) from 0.19.1 to 0.20.0.
- [Release notes](https://github.com/jaypipes/ghw/releases )
- [Commits](https://github.com/jaypipes/ghw/compare/v0.19.1...v0.20.0 )
---
updated-dependencies:
- dependency-name: github.com/jaypipes/ghw
dependency-version: 0.20.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-10 19:40:06 +01:00
dependabot[bot]
2368395a0c
chore(deps): bump github.com/mudler/cogito from 0.5.0 to 0.5.1 ( #7226 )
...
Bumps [github.com/mudler/cogito](https://github.com/mudler/cogito ) from 0.5.0 to 0.5.1.
- [Release notes](https://github.com/mudler/cogito/releases )
- [Commits](https://github.com/mudler/cogito/compare/v0.5.0...v0.5.1 )
---
updated-dependencies:
- dependency-name: github.com/mudler/cogito
dependency-version: 0.5.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-10 19:39:45 +01:00
dependabot[bot]
bf77c11b65
chore(deps): bump appleboy/ssh-action from 1.2.2 to 1.2.3 ( #7224 )
...
Bumps [appleboy/ssh-action](https://github.com/appleboy/ssh-action ) from 1.2.2 to 1.2.3.
- [Release notes](https://github.com/appleboy/ssh-action/releases )
- [Changelog](https://github.com/appleboy/ssh-action/blob/master/.goreleaser.yaml )
- [Commits](https://github.com/appleboy/ssh-action/compare/v1.2.2...v1.2.3 )
---
updated-dependencies:
- dependency-name: appleboy/ssh-action
dependency-version: 1.2.3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-10 19:34:13 +01:00
Ettore Di Giacinto
8876073f5c
feat(ui): chat stats, small visual enhancements ( #7223 )
...
* feat(ui): show stats in chat, improve style
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Markdown, small improvements
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Display token/sec into stats
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Minor enhancement
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Revert "Fixups"
This reverts commit ab1b3d6da9 .
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-10 18:12:07 +01:00
Ettore Di Giacinto
8432915cb8
feat(ui): add wizard when p2p is disabled ( #7218 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-10 10:43:54 +01:00
LocalAI [bot]
9ddb94b507
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7216 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-10 09:38:52 +01:00
LocalAI [bot]
e42f0f7e79
chore: ⬆️ Update ggml-org/llama.cpp to b8595b16e69e3029e06be3b8f6635f9812b2bc3f ( #7210 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-09 23:56:27 +01:00
Copilot
34bc1bda1e
fix(api): SSE streaming format to comply with specification ( #7182 )
...
* Initial plan
* Fix SSE streaming format to comply with specification
- Replace json.Encoder with json.Marshal for explicit formatting
- Use explicit \n\n for all SSE messages (instead of relying on implicit newlines)
- Change %v to %s format specifier for proper string formatting
- Fix error message streaming to include proper SSE format
- Ensure consistency between chat.go and completion.go endpoints
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Add proper error handling for JSON marshal failures in streaming
- Handle json.Marshal errors explicitly in error response paths
- Add fallback simple error message if marshal fails
- Prevents sending 'data: <nil>' on marshal failures
- Addresses code review feedback
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Fix SSE streaming format to comply with specification
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Fix finish_reason field to use pointer for proper null handling
- Change FinishReason from string to *string in Choice schema
- Streaming chunks now omit finish_reason (null) instead of empty string
- Final chunks properly set finish_reason to "stop", "tool_calls", etc.
- Remove empty content from initial streaming chunks (only send role)
- Final streaming chunk sends empty delta with finish_reason
- Addresses OpenAI API compliance issues causing client failures
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Improve code consistency for string pointer creation
- Use consistent pattern: declare variable then take address
- Remove inline anonymous function for better readability
- Addresses code review feedback
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Move common finish reasons to constants
- Create constants.go with FinishReasonStop, FinishReasonToolCalls, FinishReasonFunctionCall
- Replace all string literals with constants in chat.go, completion.go, realtime.go
- Improves code maintainability and prevents typos
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Make it build
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix finish_reason to always be present with null or string value
- Remove omitempty from FinishReason field in Choice struct
- Explicitly set FinishReason to nil for all streaming chunks
- Ensures finish_reason appears as null in JSON for streaming chunks
- Final chunks still properly set finish_reason to "stop", "tool_calls", etc.
- Complies with OpenAI API specification example
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-09 22:00:27 +01:00
Mikhail Khludnev
01cd58a739
fix(reranker): support omitting top_n ( #7199 )
...
* fix(reranker): support omitting top_n
Signed-off-by: Mikhail Khludnev <mkhl@apache.org >
* fix(reranker): support omitting top_n
Signed-off-by: Mikhail Khludnev <mkhl@apache.org >
* pass 0 explicitly
Signed-off-by: Mikhail Khludnev <mkhludnev@users.noreply.github.com >
---------
Signed-off-by: Mikhail Khludnev <mkhl@apache.org >
Signed-off-by: Mikhail Khludnev <mkhludnev@users.noreply.github.com >
2025-11-09 18:40:32 +01:00
Ettore Di Giacinto
679d43c2f5
feat: respect context and add request cancellation ( #7187 )
...
* feat: respect context
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* workaround fasthttp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(ui): allow to abort call
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Refactor
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: improving error
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Respect context also with MCP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Tie to both contexts
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make detection more robust
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-09 18:19:19 +01:00
LocalAI [bot]
4730b52461
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7205 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-09 08:40:40 +01:00
LocalAI [bot]
f678c6b0a9
chore: ⬆️ Update ggml-org/llama.cpp to 333f2595a3e0e4c0abf233f2f29ef1710acd134d ( #7201 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-08 21:06:17 +00:00
Ettore Di Giacinto
2f2f9beee7
fix(chatterbox): pin numpy ( #7198 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-08 16:52:22 +01:00
LocalAI [bot]
8ac7e28c12
chore: ⬆️ Update ggml-org/llama.cpp to 65156105069fa86a4a81b6cb0e8cb583f6420677 ( #7184 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-08 09:07:44 +01:00
Ettore Di Giacinto
c5c3538115
chore(ci): change cron schedule to run every 3 hours
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-07 22:36:28 +01:00
Shohidul Bari
5ef16b5693
chore: use air to live reload in dev environment ( #7186 )
...
* chore: use air to live reload in dev environment
Signed-off-by: shohidulbari <shohidulbari18@gmail.com >
* chore: update contribuing with live reload option
Signed-off-by: shohidulbari <shohidulbari18@gmail.com >
---------
Signed-off-by: shohidulbari <shohidulbari18@gmail.com >
2025-11-07 21:53:44 +01:00
Ettore Di Giacinto
02cc8cbcaa
feat(llama.cpp): consolidate options and respect tokenizer template when enabled ( #7120 )
...
* feat(llama.cpp): expose env vars as options for consistency
This allows to configure everything in the YAML file of the model rather
than have global configurations
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(llama.cpp): respect usetokenizertemplate and use llama.cpp templating system to process messages
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Detect template exists if use tokenizer template is enabled
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Better recognization of chat
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixes to support tool calls while using templates from tokenizer
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop template guessing, fix passing tools to tokenizer
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Extract grammar and other options from chat template, add schema struct
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Automatically set use_jinja
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Cleanups, identify by default gguf models for chat
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-07 21:23:50 +01:00
Ettore Di Giacinto
e5e86d0acb
Enhance README with roadmap and getting started info
...
Updated README to include roadmap link and clarify getting started section.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-07 16:50:47 +01:00
Ettore Di Giacinto
edd35d2b33
Fix formatting issues in README.md links
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-07 15:57:08 +01:00
LocalAI [bot]
e8cc29e364
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7162 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-07 08:40:29 +01:00
LocalAI [bot]
8f7c499f17
chore: ⬆️ Update ggml-org/llama.cpp to 7f09a680af6e0ef612de81018e1d19c19b8651e8 ( #7156 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-07 08:38:56 +01:00
dependabot[bot]
ea446fde08
chore(deps): bump github.com/containerd/containerd from 1.7.28 to 1.7.29 in the go_modules group across 1 directory ( #7149 )
...
chore(deps): bump github.com/containerd/containerd
Bumps the go_modules group with 1 update in the / directory: [github.com/containerd/containerd](https://github.com/containerd/containerd ).
Updates `github.com/containerd/containerd` from 1.7.28 to 1.7.29
- [Release notes](https://github.com/containerd/containerd/releases )
- [Changelog](https://github.com/containerd/containerd/blob/main/RELEASES.md )
- [Commits](https://github.com/containerd/containerd/compare/v1.7.28...v1.7.29 )
---
updated-dependencies:
- dependency-name: github.com/containerd/containerd
dependency-version: 1.7.29
dependency-type: direct:production
dependency-group: go_modules
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-06 16:40:49 +00:00
Mikhail Khludnev
122e4c7094
fix(reranker): reproduce ignoring top_n ( #7025 )
...
* fix(reranker): reproduce ignoring top_n
Signed-off-by: Mikhail Khludnev <mkhl@apache.org >
* fix(reranker): ignoring top_n
Signed-off-by: Mikhail Khludnev <mkhl@apache.org >
---------
Signed-off-by: Mikhail Khludnev <mkhl@apache.org >
2025-11-06 10:03:05 +00:00
LocalAI [bot]
2573102317
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7133 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-06 09:18:59 +01:00
Shohidul Bari
41b60fcfd3
chore: show success toast when system prompt is updated ( #7131 )
...
Signed-off-by: shohidulbari <shohidulbari18@gmail.com >
2025-11-06 09:05:57 +01:00
LocalAI [bot]
cb81869140
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7127 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-05 22:40:37 +01:00
LocalAI [bot]
db9957b94e
chore: ⬆️ Update ggml-org/llama.cpp to a44d77126c911d105f7f800c17da21b2a5b112d1 ( #7125 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-05 21:22:04 +00:00
LocalAI [bot]
98158881c2
chore: ⬆️ Update ggml-org/llama.cpp to ad51c0a720062a04349c779aae301ad65ca4c856 ( #7098 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-04 21:19:58 +00:00
Ettore Di Giacinto
79247a5d17
Clarify note about DMGs not being signed by Apple
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-04 12:09:28 +01:00
Ettore Di Giacinto
46b7a4c5f2
Add macOS DMG download information
...
Added download link and note for macOS DMG installation.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-04 12:09:07 +01:00
Ettore Di Giacinto
436e2d91d0
Enhance overview with Docker and installer details
...
Added Docker instructions and clarified one-liner installer for Linux.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-04 12:08:03 +01:00
Ettore Di Giacinto
a86fdc4087
Update binaries.md with macOS download instructions
...
Added download instructions for macOS DMG file and updated command for Linux and macOS.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-04 12:06:56 +01:00
Ettore Di Giacinto
c7ac6ca687
Update README with DMG signing note
...
Added a note about DMG signing and workarounds.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-04 12:06:01 +01:00
LocalAI [bot]
7088327e8d
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7077 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-04 09:27:33 +01:00
LocalAI [bot]
e2cb44ef37
chore: ⬆️ Update ggml-org/llama.cpp to c5023daf607c578d6344c628eb7da18ac3d92d32 ( #7069 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-04 09:26:10 +01:00
dependabot[bot]
3a40b4129c
chore(deps): bump github.com/mudler/edgevpn from 0.31.0 to 0.31.1 ( #7055 )
...
Bumps [github.com/mudler/edgevpn](https://github.com/mudler/edgevpn ) from 0.31.0 to 0.31.1.
- [Release notes](https://github.com/mudler/edgevpn/releases )
- [Changelog](https://github.com/mudler/edgevpn/blob/master/.goreleaser.yml )
- [Commits](https://github.com/mudler/edgevpn/compare/v0.31.0...v0.31.1 )
---
updated-dependencies:
- dependency-name: github.com/mudler/edgevpn
dependency-version: 0.31.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-04 02:29:50 +00:00
dependabot[bot]
4ca8055f21
chore(deps): bump github.com/valyala/fasthttp from 1.55.0 to 1.68.0 ( #7057 )
...
Bumps [github.com/valyala/fasthttp](https://github.com/valyala/fasthttp ) from 1.55.0 to 1.68.0.
- [Release notes](https://github.com/valyala/fasthttp/releases )
- [Commits](https://github.com/valyala/fasthttp/compare/v1.55.0...v1.68.0 )
---
updated-dependencies:
- dependency-name: github.com/valyala/fasthttp
dependency-version: 1.68.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-04 01:16:28 +00:00
dependabot[bot]
704786cc6d
chore(deps): bump github.com/modelcontextprotocol/go-sdk from 1.0.0 to 1.1.0 ( #7053 )
...
chore(deps): bump github.com/modelcontextprotocol/go-sdk
Bumps [github.com/modelcontextprotocol/go-sdk](https://github.com/modelcontextprotocol/go-sdk ) from 1.0.0 to 1.1.0.
- [Release notes](https://github.com/modelcontextprotocol/go-sdk/releases )
- [Commits](https://github.com/modelcontextprotocol/go-sdk/compare/v1.0.0...v1.1.0 )
---
updated-dependencies:
- dependency-name: github.com/modelcontextprotocol/go-sdk
dependency-version: 1.1.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-03 23:54:13 +00:00
dependabot[bot]
e5ce1fd9cc
chore(deps): bump github.com/onsi/ginkgo/v2 from 2.26.0 to 2.27.2 ( #7056 )
...
Bumps [github.com/onsi/ginkgo/v2](https://github.com/onsi/ginkgo ) from 2.26.0 to 2.27.2.
- [Release notes](https://github.com/onsi/ginkgo/releases )
- [Changelog](https://github.com/onsi/ginkgo/blob/master/CHANGELOG.md )
- [Commits](https://github.com/onsi/ginkgo/compare/v2.26.0...v2.27.2 )
---
updated-dependencies:
- dependency-name: github.com/onsi/ginkgo/v2
dependency-version: 2.27.2
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-03 21:49:06 +01:00
LocalAI [bot]
ea2037f141
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7068 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-03 21:48:04 +01:00
dependabot[bot]
567fa62330
chore(deps): bump github.com/mudler/cogito from 0.4.0 to 0.5.0 ( #7054 )
...
Bumps [github.com/mudler/cogito](https://github.com/mudler/cogito ) from 0.4.0 to 0.5.0.
- [Release notes](https://github.com/mudler/cogito/releases )
- [Commits](https://github.com/mudler/cogito/compare/v0.4.0...v0.5.0 )
---
updated-dependencies:
- dependency-name: github.com/mudler/cogito
dependency-version: 0.5.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-03 19:50:22 +00:00
Ettore Di Giacinto
d424a27fa2
chore: display warning only when directory is present ( #7050 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-03 18:56:47 +01:00
LocalAI [bot]
3ce9cb566d
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7040 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-03 09:09:50 +01:00
Ettore Di Giacinto
ee7638a9b0
Update index.yaml with new file entry
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-03 08:47:47 +01:00
LocalAI [bot]
e57e50e441
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7039 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-03 08:47:16 +01:00
LocalAI [bot]
81880e7975
chore(model-gallery): ⬆️ update checksum ( #7024 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-03 08:31:14 +01:00
LocalAI [bot]
2cad2c8591
chore: ⬆️ Update ggml-org/llama.cpp to cd5e3b57541ecc52421130742f4d89acbcf77cd4 ( #7023 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-02 21:24:19 +00:00
LocalAI [bot]
b87b41ee45
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7017 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-02 17:34:11 +01:00
Ettore Di Giacinto
424acd66ad
feat(llama.cpp): allow to set cache-ram and ctx_shift ( #7009 )
...
* feat(llama.cpp): allow to set cache-ram and ctx_shift
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Apply suggestion from @mudler
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-02 17:33:29 +01:00
LocalAI [bot]
3cd8234550
chore: ⬆️ Update ggml-org/whisper.cpp to 999a7e0cbf8484dc2cea1e9f855d6b39f34f7ae9 ( #6997 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-02 10:39:11 +01:00
LocalAI [bot]
c70a0f05b8
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6989 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-02 09:48:12 +01:00
LocalAI [bot]
f85e2dd1b8
chore: ⬆️ Update ggml-org/llama.cpp to 2f68ce7cfd20e9e7098514bf730e5389b7bba908 ( #6998 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-02 09:44:37 +01:00
LocalAI [bot]
e485bdf9ab
docs: ⬆️ update docs version mudler/LocalAI ( #6996 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-01 21:08:08 +00:00
Ettore Di Giacinto
495c4ee694
Update index.yaml with new model entry
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-01 09:21:22 +01:00
LocalAI [bot]
161d1a0344
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6982 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-01 09:21:00 +01:00
LocalAI [bot]
b6d1def96f
chore(model-gallery): ⬆️ update checksum ( #6972 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-01 09:13:42 +01:00
LocalAI [bot]
9ecfdc5938
chore: ⬆️ Update ggml-org/llama.cpp to 31c511a968348281e11d590446bb815048a1e912 ( #6970 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-31 21:04:53 +00:00
Ettore Di Giacinto
c332ef5cce
chore: fix linting issues
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 19:08:34 +01:00
Ettore Di Giacinto
6e7a8c6041
chore(model gallery): add qwen3-vl-2b-instruct ( #6967 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 19:04:10 +01:00
Ettore Di Giacinto
43e707ec4f
chore(model gallery): add qwen3-vl-2b-thinking ( #6966 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 19:03:23 +01:00
Ettore Di Giacinto
fed3663a74
chore(model gallery): add qwen3-vl-4b-thinking ( #6965 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 19:02:22 +01:00
Ettore Di Giacinto
5b72798db3
chore(model gallery): add qwen3-vl-32b-instruct ( #6964 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 19:01:11 +01:00
Ettore Di Giacinto
d24d6d4e93
chore(model gallery): add qwen3-vl-4b-instruct ( #6963 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 18:57:50 +01:00
Ettore Di Giacinto
50ee1fbe06
chore(model gallery): add qwen3-vl-30b-a3b-thinking ( #6962 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 18:53:13 +01:00
Ettore Di Giacinto
19f3425ce0
chore(model gallery): add huihui-qwen3-vl-30b-a3b-instruct-abliterated ( #6961 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 18:46:25 +01:00
Ettore Di Giacinto
a6ef245534
chore(model gallery): add qwen3-vl-30b-a3b-instruct ( #6960 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 18:37:12 +01:00
LocalAI [bot]
88cb379c2d
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6940 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-31 16:57:18 +01:00
LocalAI [bot]
0ddb2e8dcf
chore: ⬆️ Update ggml-org/llama.cpp to 4146d6a1a6228711a487a1e3e9ddd120f8d027d7 ( #6945 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-31 14:51:03 +00:00
Ettore Di Giacinto
91b9301bec
Rename workflow from 'Bump dependencies' to 'Bump Documentation'
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-31 14:40:50 +01:00
Ettore Di Giacinto
fad5868f7b
Rename job to 'bump-backends' in workflow
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-31 14:40:34 +01:00
LocalAI [bot]
1e5b9135df
chore: ⬆️ Update ggml-org/llama.cpp to 16724b5b6836a2d4b8936a5824d2ff27c52b4517 ( #6925 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-30 21:07:33 +00:00
LocalAI [bot]
36d19e23e0
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6921 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-30 18:58:08 +01:00
LocalAI [bot]
cba9d1aac0
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6919 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-30 17:26:18 +01:00
LocalAI [bot]
dd21a0d2f9
chore: ⬆️ Update ggml-org/llama.cpp to 3464bdac37027c5e9661621fc75ffcef3c19c6ef ( #6896 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-30 14:17:58 +01:00
LocalAI [bot]
302a43b3ae
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6911 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-30 09:54:24 +01:00
LocalAI [bot]
2955061b42
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6910 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-30 09:39:31 +01:00
LocalAI [bot]
84644ab693
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6908 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-30 09:20:23 +01:00
Ettore Di Giacinto
b8f40dde1e
feat: do also text match ( #6891 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-29 17:18:56 +01:00
LocalAI [bot]
a6c9789a54
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6884 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-29 10:56:57 +01:00
LocalAI [bot]
a48d9ce27c
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6879 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-29 08:19:51 +01:00
LocalAI [bot]
fb825a2708
chore: ⬆️ Update ggml-org/llama.cpp to 851553ea6b24cb39fd5fd188b437d777cb411de8 ( #6869 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-29 08:16:55 +01:00
LocalAI [bot]
5558dce449
chore: ⬆️ Update ggml-org/whisper.cpp to c62adfbd1ecdaea9e295c72d672992514a2d887c ( #6868 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-28 21:12:05 +00:00
LocalAI [bot]
cf74a11e65
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6864 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-28 17:20:57 +01:00
LocalAI [bot]
86b5deec81
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6863 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-28 16:23:57 +01:00
LocalAI [bot]
ba1b8e7757
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6862 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-28 15:46:06 +01:00
Ettore Di Giacinto
79b68fdc25
feat: add CPU variants for whisper.cpp ( #6855 )
...
* feat: add CPU variants for whisper.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Do not build variants on Darwin
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-28 11:05:07 +01:00
LocalAI [bot]
a946cb08b5
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6854 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-28 09:23:07 +01:00
Lukas Schaefer
d95d4992fe
feat: return complete audio for kokoro ( #6842 )
...
Signed-off-by: Lukas Schaefer <lukas@lschaefer.xyz >
2025-10-28 08:49:18 +01:00
LocalAI [bot]
e13cb8346d
chore: ⬆️ Update ggml-org/llama.cpp to 5a4ff43e7dd049e35942bc3d12361dab2f155544 ( #6841 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-28 08:48:21 +01:00
LocalAI [bot]
615c56503e
chore: ⬆️ Update ggml-org/whisper.cpp to f16c12f3f55f5bd3d6ac8cf2f31ab90a42c884d5 ( #6835 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-28 08:47:47 +01:00
dependabot[bot]
79a8edd8b9
chore(deps): bump fyne.io/fyne/v2 from 2.6.3 to 2.7.0 ( #6840 )
...
Bumps [fyne.io/fyne/v2](https://github.com/fyne-io/fyne ) from 2.6.3 to 2.7.0.
- [Release notes](https://github.com/fyne-io/fyne/releases )
- [Changelog](https://github.com/fyne-io/fyne/blob/master/CHANGELOG.md )
- [Commits](https://github.com/fyne-io/fyne/compare/v2.6.3...v2.7.0 )
---
updated-dependencies:
- dependency-name: fyne.io/fyne/v2
dependency-version: 2.7.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-28 04:57:03 +00:00
dependabot[bot]
8d138dd68f
chore(deps): bump github.com/gofiber/template/html/v2 from 2.1.2 to 2.1.3 ( #6832 )
...
chore(deps): bump github.com/gofiber/template/html/v2
Bumps [github.com/gofiber/template/html/v2](https://github.com/gofiber/template ) from 2.1.2 to 2.1.3.
- [Release notes](https://github.com/gofiber/template/releases )
- [Commits](https://github.com/gofiber/template/compare/ace/v2.1.2...ace/v2.1.3 )
---
updated-dependencies:
- dependency-name: github.com/gofiber/template/html/v2
dependency-version: 2.1.3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-28 00:22:16 +00:00
dependabot[bot]
2b33844562
chore(deps): bump actions/download-artifact from 5 to 6 ( #6837 )
...
Bumps [actions/download-artifact](https://github.com/actions/download-artifact ) from 5 to 6.
- [Release notes](https://github.com/actions/download-artifact/releases )
- [Commits](https://github.com/actions/download-artifact/compare/v5...v6 )
---
updated-dependencies:
- dependency-name: actions/download-artifact
dependency-version: '6'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:45:56 +01:00
dependabot[bot]
63e6721c2f
chore(deps): bump grpcio from 1.75.1 to 1.76.0 in /backend/python/diffusers ( #6839 )
...
chore(deps): bump grpcio in /backend/python/diffusers
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.75.1 to 1.76.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.75.1...v1.76.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.76.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:45:42 +01:00
Ettore Di Giacinto
4859d809aa
Update MCP link and add LocalAGI reference
...
Updated the link for Model Context Protocol (MCP) and added reference to LocalAGI's Agentic capabilities.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-27 21:44:41 +01:00
dependabot[bot]
be027b1ccd
chore(deps): bump grpcio from 1.75.1 to 1.76.0 in /backend/python/transformers ( #6828 )
...
chore(deps): bump grpcio in /backend/python/transformers
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.75.1 to 1.76.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.75.1...v1.76.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.76.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:32:31 +01:00
dependabot[bot]
3ecadeeb93
chore(deps): bump grpcio from 1.75.1 to 1.76.0 in /backend/python/exllama2 ( #6836 )
...
chore(deps): bump grpcio in /backend/python/exllama2
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.75.1 to 1.76.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.75.1...v1.76.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.76.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:32:13 +01:00
dependabot[bot]
4af3348f91
chore(deps): bump grpcio from 1.75.1 to 1.76.0 in /backend/python/vllm ( #6827 )
...
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.75.1 to 1.76.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.75.1...v1.76.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.76.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:31:47 +01:00
dependabot[bot]
dde08845bf
chore(deps): bump grpcio from 1.75.1 to 1.76.0 in /backend/python/bark ( #6826 )
...
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.75.1 to 1.76.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.75.1...v1.76.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.76.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:31:24 +01:00
dependabot[bot]
76d1ba168d
chore(deps): bump grpcio from 1.75.1 to 1.76.0 in /backend/python/common/template ( #6830 )
...
chore(deps): bump grpcio in /backend/python/common/template
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.75.1 to 1.76.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.75.1...v1.76.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.76.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:31:03 +01:00
dependabot[bot]
80605e4f66
chore(deps): bump grpcio from 1.75.1 to 1.76.0 in /backend/python/rerankers ( #6819 )
...
chore(deps): bump grpcio in /backend/python/rerankers
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.75.1 to 1.76.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.75.1...v1.76.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.76.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:30:46 +01:00
dependabot[bot]
5b99584a31
chore(deps): bump github.com/klauspost/cpuid/v2 from 2.2.10 to 2.3.0 ( #6821 )
...
Bumps [github.com/klauspost/cpuid/v2](https://github.com/klauspost/cpuid ) from 2.2.10 to 2.3.0.
- [Release notes](https://github.com/klauspost/cpuid/releases )
- [Changelog](https://github.com/klauspost/cpuid/blob/master/.goreleaser.yml )
- [Commits](https://github.com/klauspost/cpuid/compare/v2.2.10...v2.3.0 )
---
updated-dependencies:
- dependency-name: github.com/klauspost/cpuid/v2
dependency-version: 2.3.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:30:05 +01:00
dependabot[bot]
fc134b18fe
chore(deps): bump actions/upload-artifact from 4 to 5 ( #6824 )
...
Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact ) from 4 to 5.
- [Release notes](https://github.com/actions/upload-artifact/releases )
- [Commits](https://github.com/actions/upload-artifact/compare/v4...v5 )
---
updated-dependencies:
- dependency-name: actions/upload-artifact
dependency-version: '5'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:29:35 +01:00
dependabot[bot]
c2006273c5
chore(deps): bump github.com/alecthomas/kong from 0.9.0 to 1.12.1 ( #6829 )
...
Bumps [github.com/alecthomas/kong](https://github.com/alecthomas/kong ) from 0.9.0 to 1.12.1.
- [Commits](https://github.com/alecthomas/kong/compare/v0.9.0...v1.12.1 )
---
updated-dependencies:
- dependency-name: github.com/alecthomas/kong
dependency-version: 1.12.1
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:29:00 +01:00
dependabot[bot]
5343889098
chore(deps): bump github.com/gofiber/swagger from 1.0.0 to 1.1.1 ( #6825 )
...
Bumps [github.com/gofiber/swagger](https://github.com/gofiber/swagger ) from 1.0.0 to 1.1.1.
- [Release notes](https://github.com/gofiber/swagger/releases )
- [Commits](https://github.com/gofiber/swagger/compare/v1.0.0...v1.1.1 )
---
updated-dependencies:
- dependency-name: github.com/gofiber/swagger
dependency-version: 1.1.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:27:59 +01:00
dependabot[bot]
c42afc56d9
chore(deps): bump mxschmitt/action-tmate from 3.22 to 3.23 ( #6831 )
...
Bumps [mxschmitt/action-tmate](https://github.com/mxschmitt/action-tmate ) from 3.22 to 3.23.
- [Release notes](https://github.com/mxschmitt/action-tmate/releases )
- [Changelog](https://github.com/mxschmitt/action-tmate/blob/master/RELEASE.md )
- [Commits](https://github.com/mxschmitt/action-tmate/compare/v3.22...v3.23 )
---
updated-dependencies:
- dependency-name: mxschmitt/action-tmate
dependency-version: '3.23'
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:27:36 +01:00
dependabot[bot]
53f44dac89
chore(deps): bump grpcio from 1.75.1 to 1.76.0 in /backend/python/coqui ( #6822 )
...
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.75.1 to 1.76.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.75.1...v1.76.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.76.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 21:27:05 +01:00
LocalAI [bot]
0468456fad
feat(swagger): update swagger ( #6834 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-27 21:24:28 +01:00
dependabot[bot]
df899ee26a
chore(deps): bump github.com/gofiber/contrib/fiberzerolog from 1.0.2 to 1.0.3 ( #6816 )
...
chore(deps): bump github.com/gofiber/contrib/fiberzerolog
Bumps [github.com/gofiber/contrib/fiberzerolog](https://github.com/gofiber/contrib ) from 1.0.2 to 1.0.3.
- [Release notes](https://github.com/gofiber/contrib/releases )
- [Commits](https://github.com/gofiber/contrib/compare/jwt/v1.0.2...jwt/v1.0.3 )
---
updated-dependencies:
- dependency-name: github.com/gofiber/contrib/fiberzerolog
dependency-version: 1.0.3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-27 19:29:57 +00:00
LocalAI [bot]
93fe25468f
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6799 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-27 10:31:52 +01:00
Ettore Di Giacinto
238aad666e
chore(deps): bump cogito ( #6785 )
...
chore(deps): Bump cogito
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-27 10:07:31 +01:00
Gregory Mariani
4408ed4f88
feat(api): OpenAI video create enpoint integration ( #6777 )
...
* feat: add OpenAI-compatible /v1/videos endpoint
- Add VideoEndpoint handler with OpenAI request mapping
- Add MapOpenAIToVideo function to convert OpenAI format to LocalAI VideoRequest
- Add Swagger documentation for API endpoint
- Add Ginkgo unit tests for mapping logic
- Add Ginkgo integration test with embedded fake backend
Signed-off-by: Greg <marianigregory@pm.me >
* Apply suggestion from @mudler
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
* Apply suggestion from @mudler
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
* Apply suggestion from @mudler
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
* Apply suggestion from @mudler
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
* Apply suggestion from @mudler
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
* Apply suggestion from @mudler
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
---------
Signed-off-by: Greg <marianigregory@pm.me >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-27 10:06:57 +01:00
Ettore Di Giacinto
5df1f59a3c
chore(model gallery): add allenai_olmocr-2-7b-1025 ( #6797 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-27 09:20:06 +01:00
LocalAI [bot]
8225697139
chore: ⬆️ Update ggml-org/llama.cpp to bbac6a26b2bd7f7c1f0831cb1e7b52734c66673b ( #6783 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-27 08:45:14 +01:00
LocalAI [bot]
0c0186d866
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6784 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-26 21:22:58 +01:00
LocalAI [bot]
ce2f8828f9
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6776 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-26 18:01:09 +01:00
LocalAI [bot]
7a8565a45e
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6767 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-26 09:27:17 +01:00
LocalAI [bot]
192589a17f
chore: ⬆️ Update ggml-org/llama.cpp to 5d195f17bc60eacc15cfb929f9403cf29ccdf419 ( #6757 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-25 21:14:43 +00:00
LocalAI [bot]
28ab73d4a1
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6721 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-25 17:23:43 +02:00
LocalAI [bot]
ed4ac0b61e
chore: ⬆️ Update ggml-org/llama.cpp to 55945d2ef51b93821d4b6f4a9b994393344a90db ( #6729 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-24 21:11:56 +00:00
LocalAI [bot]
e41d8b65ce
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6706 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-24 09:04:00 +02:00
Chakib Benziane
c28e5b39d6
fix: llama dockerfile make package ( #6694 )
...
the make package rule does not currently always run resulting in an
empty scratch image.
- added `make -B` flag to force the `make package` rule
Signed-off-by: blob42 <contact@blob42.xyz >
2025-10-24 09:03:11 +02:00
LocalAI [bot]
b66bd2706f
chore: ⬆️ Update ggml-org/llama.cpp to 0bf47a1dbba4d36f2aff4e8c34b06210ba34e688 ( #6703 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-23 21:10:51 +00:00
LocalAI [bot]
fa7a9d96f8
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6697 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-23 16:43:24 +02:00
LocalAI [bot]
61d972a2ef
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6691 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-23 11:27:04 +02:00
LocalAI [bot]
fffdbc31c6
chore: ⬆️ Update ggml-org/whisper.cpp to 322c2adb753a9506f0becee134a7f75e2a6b5687 ( #6677 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-23 09:32:20 +02:00
Chakib Benziane
32c0ab3a7f
fix: properly terminate llama.cpp kv_overrides array with empty key + updated doc ( #6672 )
...
* fix: properly terminate kv_overrides array with empty key
The llama model loading function expects KV overrides to be terminated
with an empty key (key[0] == 0). Previously, the kv_overrides vector was
not being properly terminated, causing an assertion failure.
This commit ensures that after parsing all KV override strings, we add a
final terminating entry with an empty key to satisfy the C-style array
termination requirement. This fixes the assertion error and allows the
model to load correctly with custom KV overrides.
Fixes #6643
- Also included a reference to the usage of the `overrides` option in
the advanced-usage section.
Signed-off-by: blob42 <contact@blob42.xyz >
* doc: document the `overrides` option
---------
Signed-off-by: blob42 <contact@blob42.xyz >
2025-10-23 09:31:55 +02:00
LocalAI [bot]
24ce79a67c
chore: ⬆️ Update ggml-org/llama.cpp to a2e0088d9242bd9e57f8b852b05a6e47843b5a45 ( #6676 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-22 21:05:27 +00:00
Ettore Di Giacinto
bfa8530088
Change model note to reflect text-only version
...
Updated note to indicate this is a text-only version.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-22 10:55:25 +02:00
LocalAI [bot]
4278144dd5
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6664 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-22 10:34:06 +02:00
LocalAI [bot]
79fa4d691e
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6658 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-22 09:40:19 +02:00
LocalAI [bot]
7a3d9ee5c1
chore: ⬆️ Update ggml-org/llama.cpp to 03792ad93609fc67e41041c6347d9aa14e5e0d74 ( #6651 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-21 21:15:27 +00:00
LocalAI [bot]
22923d3b23
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6646 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-21 19:30:14 +02:00
LocalAI [bot]
d32a459209
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6640 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-21 15:38:23 +02:00
Ettore Di Giacinto
47b2a502dd
Revert "chore(deps): bump securego/gosec from 2.22.9 to 2.22.10" ( #6638 )
...
Revert "chore(deps): bump securego/gosec from 2.22.9 to 2.22.10 (#6599 )"
This reverts commit 18810038f5 .
2025-10-21 12:31:16 +02:00
LocalAI [bot]
b85f339eb4
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6636 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-21 12:14:38 +02:00
LocalAI [bot]
8821865eac
chore: ⬆️ Update ggml-org/whisper.cpp to 23c19308d8a5786c65effa4570204a881660ff31 ( #6622 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-21 09:22:50 +02:00
LocalAI [bot]
4b30846d57
chore: ⬆️ Update ggml-org/llama.cpp to 84bf3c677857279037adf67cdcfd89eaa4ca9281 ( #6621 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-21 09:22:03 +02:00
dependabot[bot]
7a35986407
chore(deps): bump github.com/gofrs/flock from 0.12.1 to 0.13.0 ( #6598 )
...
Bumps [github.com/gofrs/flock](https://github.com/gofrs/flock ) from 0.12.1 to 0.13.0.
- [Release notes](https://github.com/gofrs/flock/releases )
- [Commits](https://github.com/gofrs/flock/compare/v0.12.1...v0.13.0 )
---
updated-dependencies:
- dependency-name: github.com/gofrs/flock
dependency-version: 0.13.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-20 22:54:53 +02:00
dependabot[bot]
ee34aa7bd5
chore(deps): bump github.com/onsi/ginkgo/v2 from 2.25.3 to 2.26.0 ( #6601 )
...
Bumps [github.com/onsi/ginkgo/v2](https://github.com/onsi/ginkgo ) from 2.25.3 to 2.26.0.
- [Release notes](https://github.com/onsi/ginkgo/releases )
- [Changelog](https://github.com/onsi/ginkgo/blob/master/CHANGELOG.md )
- [Commits](https://github.com/onsi/ginkgo/compare/v2.25.3...v2.26.0 )
---
updated-dependencies:
- dependency-name: github.com/onsi/ginkgo/v2
dependency-version: 2.26.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-20 22:54:20 +02:00
dependabot[bot]
40cf798dfe
chore(deps): bump github.com/gpustack/gguf-parser-go from 0.17.0 to 0.22.1 ( #6602 )
...
chore(deps): bump github.com/gpustack/gguf-parser-go
Bumps [github.com/gpustack/gguf-parser-go](https://github.com/gpustack/gguf-parser-go ) from 0.17.0 to 0.22.1.
- [Release notes](https://github.com/gpustack/gguf-parser-go/releases )
- [Commits](https://github.com/gpustack/gguf-parser-go/compare/v0.17.0...v0.22.1 )
---
updated-dependencies:
- dependency-name: github.com/gpustack/gguf-parser-go
dependency-version: 0.22.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-20 22:53:47 +02:00
dependabot[bot]
18810038f5
chore(deps): bump securego/gosec from 2.22.9 to 2.22.10 ( #6599 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.9 to 2.22.10.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.9...v2.22.10 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.10
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-20 22:53:18 +02:00
dependabot[bot]
8fb79bc6f6
chore(deps): bump github.com/tmc/langchaingo from 0.1.13 to 0.1.14 ( #6604 )
...
Bumps [github.com/tmc/langchaingo](https://github.com/tmc/langchaingo ) from 0.1.13 to 0.1.14.
- [Release notes](https://github.com/tmc/langchaingo/releases )
- [Commits](https://github.com/tmc/langchaingo/compare/v0.1.13...v0.1.14 )
---
updated-dependencies:
- dependency-name: github.com/tmc/langchaingo
dependency-version: 0.1.14
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-20 22:52:50 +02:00
dependabot[bot]
4b5ad1405f
chore(deps): bump github.com/prometheus/client_golang from 1.23.0 to 1.23.2 ( #6600 )
...
chore(deps): bump github.com/prometheus/client_golang
Bumps [github.com/prometheus/client_golang](https://github.com/prometheus/client_golang ) from 1.23.0 to 1.23.2.
- [Release notes](https://github.com/prometheus/client_golang/releases )
- [Changelog](https://github.com/prometheus/client_golang/blob/main/CHANGELOG.md )
- [Commits](https://github.com/prometheus/client_golang/compare/v1.23.0...v1.23.2 )
---
updated-dependencies:
- dependency-name: github.com/prometheus/client_golang
dependency-version: 1.23.2
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-20 19:39:11 +00:00
LocalAI [bot]
4493078cdd
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6597 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-20 19:54:38 +02:00
LocalAI [bot]
7f68c89cbe
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6581 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-20 08:58:29 +02:00
LocalAI [bot]
69adc46936
chore: ⬆️ Update ggml-org/llama.cpp to cec5edbcaec69bbf6d5851cabce4ac148be41701 ( #6576 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-19 21:31:47 +00:00
Ettore Di Giacinto
d22439918f
Add icon URL for Llama-3.2-3B-small model
...
Added an icon URL for the Llama-3.2-3B-small Shiro roleplay model.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-19 21:44:46 +02:00
LocalAI [bot]
103d4e87e5
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6566 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-19 21:43:58 +02:00
Ettore Di Giacinto
8c5ba9e0d7
Update verbamaxima-12b-i1 details in index.yaml
...
Added an icon and updated the name and description for verbamaxima-12b-i1.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-19 10:23:37 +02:00
LocalAI [bot]
f1b713df08
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6557 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-19 09:44:01 +02:00
LocalAI [bot]
f94b89c1b5
chore: ⬆️ Update ggml-org/llama.cpp to ee09828cb057460b369576410601a3a09279e23c ( #6550 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-18 21:09:46 +00:00
Ettore Di Giacinto
a1b056737a
feat(ui): add system backend metadata and deletion in index ( #6546 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-18 19:32:11 +02:00
Ettore Di Giacinto
a22f6a499d
feat(mcp): add planning and reevaluation ( #6541 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-18 18:26:32 +02:00
Ettore Di Giacinto
e5bf2a9a11
chore(gallery agent): small fixes
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-18 18:24:25 +02:00
Ettore Di Giacinto
05aba5a311
chore(gallery agent): add execution summary in the PR message
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-18 18:17:42 +02:00
LocalAI [bot]
354bf5debb
chore(model gallery): 🤖 add new models via gallery agent ( #6536 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-18 13:48:19 +02:00
LocalAI [bot]
7f88abb3b1
chore(model gallery): 🤖 add new models via gallery agent ( #6534 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-18 13:01:01 +02:00
LocalAI [bot]
36b3a538f8
chore(model gallery): 🤖 add new models via gallery agent ( #6524 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-18 09:35:51 +02:00
LocalAI [bot]
e293b65ad9
chore(model gallery): 🤖 add new models via gallery agent ( #6522 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-17 23:18:28 +02:00
LocalAI [bot]
cce185b345
chore: ⬆️ Update ggml-org/llama.cpp to 66b0dbcb2d462e7b70ba5a69ee8c3899ac2efb1c ( #6520 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-17 21:14:57 +00:00
LocalAI [bot]
03ed4382c7
chore(model gallery): 🤖 add new models via gallery agent ( #6519 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-17 21:42:31 +02:00
LocalAI [bot]
1c73e10676
chore(model gallery): 🤖 add new models via gallery agent ( #6516 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-17 20:02:10 +02:00
LocalAI [bot]
4ade65f959
chore(model gallery): 🤖 add new models via gallery agent ( #6515 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-17 18:15:44 +02:00
LocalAI [bot]
c54f5cdf12
chore(model gallery): 🤖 add new models via gallery agent ( #6512 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-17 16:04:54 +02:00
Ettore Di Giacinto
33c48164d7
Change sleep duration in tests from 10 to 20 seconds
...
Increased sleep duration in tests to 20 seconds.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-17 14:56:52 +02:00
Ettore Di Giacinto
7aed3b3bac
Add icon URL for Magidonia 24B v4.2.0
...
Added an icon URL for the Magidonia model.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-17 11:14:02 +02:00
LocalAI [bot]
9e349c715e
chore(model gallery): 🤖 add new models via gallery agent ( #6507 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-17 11:12:34 +02:00
Robbie Cronin
639ecb59b3
fix: handle multi-type arrays in JSON schema to prevent panic ( #6495 )
...
Signed-off-by: robert-cronin <robert.owen.cronin@gmail.com >
2025-10-17 11:07:37 +02:00
LocalAI [bot]
bfb0794f87
chore(model gallery): 🤖 add new models via gallery agent ( #6504 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-17 09:25:40 +02:00
Ettore Di Giacinto
05f1e9e757
chore(model gallery): add mira-v1.7-27b-i1 ( #6503 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-17 09:19:42 +02:00
Ettore Di Giacinto
1ca6f6dada
Revert "Reapply "chore(ci): change notifications settings""
...
This reverts commit f0245fa36c .
2025-10-17 09:15:12 +02:00
LocalAI [bot]
bc5397bcfc
chore(model gallery): 🤖 add new models via gallery agent ( #6501 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-17 09:13:11 +02:00
Ettore Di Giacinto
f452a027a2
chore(gallery search): fuzzy with case insentivie ( #6490 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-17 09:02:28 +02:00
LocalAI [bot]
7bac49fb87
chore: ⬆️ Update ggml-org/llama.cpp to 1bb4f43380944e94c9a86e305789ba103f5e62bd ( #6488 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-17 09:01:11 +02:00
Ettore Di Giacinto
02300cfbd1
Change cron schedule to run every hour
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-16 23:10:33 +02:00
Ettore Di Giacinto
17c5c732c7
Update condition for Dependabot job in workflow
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-16 23:09:32 +02:00
Ettore Di Giacinto
10a66938f9
fix: guard from potential deadlock with requests in flight ( #6484 )
...
* fix(watchdog): guard from potential deadlock with requests in flight
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Improve locking when loading models
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-16 21:28:19 +02:00
Ettore Di Giacinto
f0245fa36c
Reapply "chore(ci): change notifications settings"
...
This reverts commit 03096154d4 .
2025-10-16 18:50:55 +02:00
Ettore Di Giacinto
83534f8e00
feat(gallery): add fuzzy search ( #6481 )
...
chore(model gallery): add fuzzy search
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-16 18:51:33 +02:00
Ettore Di Giacinto
75eaf8c853
Increase default LIMIT from 5 to 15
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-16 18:16:43 +02:00
Ettore Di Giacinto
03096154d4
Revert "chore(ci): change notifications settings"
...
This reverts commit 9109e5c149 .
2025-10-16 17:03:52 +02:00
LocalAI [bot]
22c9e8c09e
gallery: 🤖 add new models via gallery agent ( #6480 )
...
🤖 Add new models to gallery via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-16 16:56:34 +02:00
Ettore Di Giacinto
da16727ad6
chore(ci): keep conventional commits
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-16 16:35:03 +02:00
Ettore Di Giacinto
ad44df6d83
chore(ci): run notifications also from PRs merged from forks
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-16 16:27:04 +02:00
LocalAI [bot]
276c552583
gallery: 🤖 add new models via gallery agent ( #6478 )
...
🤖 Add new models to gallery via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-16 16:21:42 +02:00
Ettore Di Giacinto
9109e5c149
chore(ci): change notifications settings
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-16 16:12:40 +02:00
Ettore Di Giacinto
71a84b91e3
chore(ci): fix gallery agent linting issues
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-16 16:05:17 +02:00
Ettore Di Giacinto
209d40be71
Revert "chore(gallery agent): try fixing linting errors"
...
This reverts commit b0eb1ab2a1 .
2025-10-16 10:32:21 +02:00
Ettore Di Giacinto
bfd76805e8
Revert "chore(gallery agent): try fixing linting errors"
...
This reverts commit 561aa5e443 .
2025-10-16 10:30:27 +02:00
Ettore Di Giacinto
561aa5e443
chore(gallery agent): try fixing linting errors
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-16 10:05:37 +02:00
Ettore Di Giacinto
b0eb1ab2a1
chore(gallery agent): try fixing linting errors
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-16 09:44:04 +02:00
Ettore Di Giacinto
1208fb6fa1
chore(gallery agent): support non-gguf
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-16 09:35:42 +02:00
Ettore Di Giacinto
f98fe85c42
chore(gallery agent): try to fix linting
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-16 09:14:28 +02:00
Ettore Di Giacinto
167c183c84
chore(gallery agent): avoid to commit the binary
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-15 23:30:14 +02:00
Ettore Di Giacinto
244e47e1e0
chore(gallery agent): filter out existing
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-15 23:26:47 +02:00
LocalAI [bot]
9680a0b0fe
chore: ⬆️ Update ggml-org/llama.cpp to 466c1911ab736f0b7366127edee99f8ee5687417 ( #6463 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-15 23:21:35 +02:00
LocalAI [bot]
acbd10a661
chore: ⬆️ Update ggml-org/whisper.cpp to 4979e04f5dcaccb36057e059bbaed8a2f5288315 ( #6462 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-15 23:21:05 +02:00
Ettore Di Giacinto
c6b989be13
chore(gallery agent): clean content
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-15 23:20:44 +02:00
Ettore Di Giacinto
670103705c
chore(gallery agent): separate PRs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-15 23:16:39 +02:00
Ettore Di Giacinto
cb90bd226e
chore(gallery agent): bump repos to fetch
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-15 23:12:21 +02:00
Ettore Di Giacinto
df9b2abf84
chore(gallery agent): Use real readme for selection
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-15 23:11:49 +02:00
Ettore Di Giacinto
582114bda9
feat(ci): add gallery updater agent ( #6467 )
...
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add CI workflow
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-15 23:00:14 +02:00
Ettore Di Giacinto
91ffe5ac38
chore(model gallery): add boomerang-qwen3-4.9b ( #6461 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-15 19:31:26 +02:00
Ettore Di Giacinto
8a58d76254
chore(model gallery): add boomerang-qwen3-2.3b ( #6460 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-15 19:30:24 +02:00
Ettore Di Giacinto
c3442fe574
chore(model gallery): add demyagent-4b-i1 ( #6459 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-15 19:27:59 +02:00
Ettore Di Giacinto
1087bd217e
chore(model gallery): add qwen3-4b-ra-sft ( #6458 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-15 19:22:29 +02:00
LocalAI [bot]
7ed3666d2e
chore: ⬆️ Update ggml-org/llama.cpp to fa882fd2b1bcb663de23af06fdc391489d05b007 ( #6454 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-14 21:08:17 +00:00
LocalAI [bot]
2e2e89e499
chore: ⬆️ Update ggml-org/llama.cpp to e60f241eacec42d3bd7c9edd37d236ebf35132a8 ( #6452 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-14 09:06:39 +02:00
dependabot[bot]
13c9c20f42
chore(deps): bump google.golang.org/grpc from 1.67.1 to 1.76.0 ( #6449 )
...
Bumps [google.golang.org/grpc](https://github.com/grpc/grpc-go ) from 1.67.1 to 1.76.0.
- [Release notes](https://github.com/grpc/grpc-go/releases )
- [Commits](https://github.com/grpc/grpc-go/compare/v1.67.1...v1.76.0 )
---
updated-dependencies:
- dependency-name: google.golang.org/grpc
dependency-version: 1.76.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-13 22:13:35 +00:00
dependabot[bot]
b3d3988d85
chore(deps): bump github.com/ebitengine/purego from 0.8.4 to 0.9.0 ( #6450 )
...
Bumps [github.com/ebitengine/purego](https://github.com/ebitengine/purego ) from 0.8.4 to 0.9.0.
- [Release notes](https://github.com/ebitengine/purego/releases )
- [Commits](https://github.com/ebitengine/purego/compare/v0.8.4...v0.9.0 )
---
updated-dependencies:
- dependency-name: github.com/ebitengine/purego
dependency-version: 0.9.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-13 21:00:17 +00:00
Richard Palethorpe
0529c7d0a0
fix(realtime): Add transcription session created event, match OpenAI behavior ( #6445 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-10-13 21:48:13 +02:00
dependabot[bot]
af31a77061
chore(deps): bump dario.cat/mergo from 1.0.1 to 1.0.2 ( #6447 )
...
Bumps [dario.cat/mergo](https://github.com/imdario/mergo ) from 1.0.1 to 1.0.2.
- [Release notes](https://github.com/imdario/mergo/releases )
- [Commits](https://github.com/imdario/mergo/compare/v1.0.1...v1.0.2 )
---
updated-dependencies:
- dependency-name: dario.cat/mergo
dependency-version: 1.0.2
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-13 21:47:24 +02:00
dependabot[bot]
2d8956167f
chore(deps): bump github.com/schollz/progressbar/v3 from 3.14.4 to 3.18.0 ( #6446 )
...
chore(deps): bump github.com/schollz/progressbar/v3
Bumps [github.com/schollz/progressbar/v3](https://github.com/schollz/progressbar ) from 3.14.4 to 3.18.0.
- [Release notes](https://github.com/schollz/progressbar/releases )
- [Commits](https://github.com/schollz/progressbar/compare/v3.14.4...v3.18.0 )
---
updated-dependencies:
- dependency-name: github.com/schollz/progressbar/v3
dependency-version: 3.18.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-13 21:47:02 +02:00
dependabot[bot]
509f85f82c
chore(deps): bump github.com/containerd/containerd from 1.7.27 to 1.7.28 ( #6448 )
...
Bumps [github.com/containerd/containerd](https://github.com/containerd/containerd ) from 1.7.27 to 1.7.28.
- [Release notes](https://github.com/containerd/containerd/releases )
- [Changelog](https://github.com/containerd/containerd/blob/main/RELEASES.md )
- [Commits](https://github.com/containerd/containerd/compare/v1.7.27...v1.7.28 )
---
updated-dependencies:
- dependency-name: github.com/containerd/containerd
dependency-version: 1.7.28
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-13 21:46:36 +02:00
dependabot[bot]
bb2b377b18
chore(deps): bump github/codeql-action from 3 to 4 ( #6451 )
...
Bumps [github/codeql-action](https://github.com/github/codeql-action ) from 3 to 4.
- [Release notes](https://github.com/github/codeql-action/releases )
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md )
- [Commits](https://github.com/github/codeql-action/compare/v3...v4 )
---
updated-dependencies:
- dependency-name: github/codeql-action
dependency-version: '4'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-13 21:45:41 +02:00
Ettore Di Giacinto
48917889ce
Fix formatting in gallery/index.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-13 21:45:32 +02:00
Ettore Di Giacinto
ef754259b0
chore(model gallery): add gustavecortal_beck ( #6444 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-13 11:59:25 +02:00
Ettore Di Giacinto
7e26f28113
chore(model gallery): add agentflow_agentflow-planner-7b ( #6443 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-13 11:08:42 +02:00
Ettore Di Giacinto
d7c8129549
chore(model gallery): add yanolja_yanoljanext-rosetta-12b-2510 ( #6442 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-13 11:00:48 +02:00
LocalAI [bot]
3a8fbb698e
chore: ⬆️ Update ggml-org/llama.cpp to a31cf36ad946a13b3a646bf0dadf2a481e89f944 ( #6440 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-13 07:54:03 +02:00
LocalAI [bot]
b1ef34ef9f
chore: ⬆️ Update ggml-org/whisper.cpp to a91dd3be72f70dd1b3cb6e252f35fa17b93f596c ( #6439 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-13 07:51:47 +02:00
Ettore Di Giacinto
b7822250fe
fix(ui): models without config don't have a .Name field ( #6438 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-12 15:24:22 +02:00
dependabot[bot]
05055f7e95
chore(deps): bump github.com/quic-go/quic-go from 0.54.0 to 0.54.1 in the go_modules group across 1 directory ( #6431 )
...
chore(deps): bump github.com/quic-go/quic-go
Bumps the go_modules group with 1 update in the / directory: [github.com/quic-go/quic-go](https://github.com/quic-go/quic-go ).
Updates `github.com/quic-go/quic-go` from 0.54.0 to 0.54.1
- [Release notes](https://github.com/quic-go/quic-go/releases )
- [Commits](https://github.com/quic-go/quic-go/compare/v0.54.0...v0.54.1 )
---
updated-dependencies:
- dependency-name: github.com/quic-go/quic-go
dependency-version: 0.54.1
dependency-type: indirect
dependency-group: go_modules
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-12 13:29:39 +02:00
LocalAI [bot]
c856d7dc73
chore: ⬆️ Update ggml-org/llama.cpp to 11f0af5504252e453d57406a935480c909e3ff37 ( #6437 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-12 09:02:31 +02:00
Ettore Di Giacinto
69d565e55d
fix(mcp): make responses compliant to OpenAI APIs ( #6436 )
...
The result is enclosed in the Message field.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-11 16:58:11 +02:00
LocalAI [bot]
fa6bbd9fa2
chore: ⬆️ Update ggml-org/llama.cpp to e60f01d941bc5b7fae62dd57fee4cec76ec0ea6e ( #6434 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-11 09:30:48 +02:00
LocalAI [bot]
3f767121d2
chore: ⬆️ Update ggml-org/whisper.cpp to 85871a946971955c635f56bca24ea2a37fed6324 ( #6435 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-10 21:16:25 +00:00
Ettore Di Giacinto
e963e16bc5
Remove model size guidance from FAQ
...
Removed redundant information about model sizes in the WebUI.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-10 21:50:56 +02:00
Ettore Di Giacinto
1e9b115251
chore(docs): enhancements and clarifications ( #6433 )
...
chore(docs): Small enhancements
Fixes: https://github.com/mudler/LocalAI/issues/6250
Relates to: https://github.com/mudler/LocalAI/issues/6251
Fixes: https://github.com/mudler/LocalAI/issues/6249
Fixes: https://github.com/mudler/LocalAI/issues/6250
Fixes: https://github.com/mudler/LocalAI/issues/6253
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-10 21:31:11 +02:00
Ettore Di Giacinto
cd1e1124ea
fix(llama.cpp): correctly set grammar triggers ( #6432 )
...
* fix(llama.cpp): correctly set grammar triggers
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Do not enable lazy by default
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-10 19:50:17 +02:00
Ettore Di Giacinto
81b31b4283
chore(ui): display models and backends in tables ( #6430 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-10 18:21:34 +02:00
Ettore Di Giacinto
d763bce46d
fix(mcp): register sessions ( #6429 )
...
Because of this, the first calls to the MCP endpoint would fail and
later would succeeds thanks to the cache.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-10 18:03:07 +02:00
Ettore Di Giacinto
4aac0ef42e
chore(api): return json errors ( #6428 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-10 17:17:12 +02:00
Ettore Di Giacinto
7a36e8d967
chore(ui): skip duplicated entries in search list ( #6425 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-10 16:35:05 +02:00
Ettore Di Giacinto
dc2be93412
chore(ui): simplify editing and importing models via YAML ( #6424 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-10 15:10:13 +02:00
Ettore Di Giacinto
69a2b91495
chore: change color palette such as is closer to the logo ( #6423 )
...
chore(ui): restyle color palette closer to logo
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-10 12:31:58 +02:00
Ettore Di Giacinto
791bc769c1
chore(deps): bump llama.cpp to '1deee0f8d494981c32597dca8b5f8696d399b0f2' ( #6421 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-10 09:51:22 +02:00
LocalAI [bot]
a15a1f07e3
chore: ⬆️ Update ggml-org/whisper.cpp to 98930fded1c06e601a38903607af262f04893880 ( #6420 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-09 23:45:15 +02:00
Ettore Di Giacinto
c6f0b44228
feat(ui): use Alpine.js and drop HTMX ( #6418 )
...
* feat(ui): use Alpine.js and drop HTMX
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Display pending ops
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Show in progress ops
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* more stable sorting
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* minor fixup
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix clipboard copy
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Cleanup
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-09 22:37:06 +02:00
Ettore Di Giacinto
cb0ed55d89
feat(neutts): add backend ( #6404 )
...
* feat(neutts): add backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(ci): add images to CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(gallery): add Neutts
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make it work with quantized versions
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Apply suggestion from @mudler
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
* Apply suggestion from @mudler
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
* Apply suggestion from @mudler
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-09 21:51:28 +02:00
Ettore Di Giacinto
2fe97110fd
Change backend from 'whisper-vad' to 'whisper'
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-09 17:58:24 +02:00
Ettore Di Giacinto
fa8037b21d
chore(model gallery): add ai21labs_ai21-jamba-reasoning-3b ( #6417 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-09 15:00:56 +02:00
dependabot[bot]
99a72a4b11
chore(deps): bump github.com/multiformats/go-multiaddr from 0.16.0 to 0.16.1 ( #6277 )
...
chore(deps): bump github.com/multiformats/go-multiaddr
Bumps [github.com/multiformats/go-multiaddr](https://github.com/multiformats/go-multiaddr ) from 0.16.0 to 0.16.1.
- [Release notes](https://github.com/multiformats/go-multiaddr/releases )
- [Commits](https://github.com/multiformats/go-multiaddr/compare/v0.16.0...v0.16.1 )
---
updated-dependencies:
- dependency-name: github.com/multiformats/go-multiaddr
dependency-version: 0.16.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-09 12:26:24 +00:00
dependabot[bot]
1a52ce1bd4
chore(deps): bump github.com/docker/docker from 28.3.3+incompatible to 28.5.0+incompatible ( #6399 )
...
chore(deps): bump github.com/docker/docker
Bumps [github.com/docker/docker](https://github.com/docker/docker ) from 28.3.3+incompatible to 28.5.0+incompatible.
- [Release notes](https://github.com/docker/docker/releases )
- [Commits](https://github.com/docker/docker/compare/v28.3.3...v28.5.0 )
---
updated-dependencies:
- dependency-name: github.com/docker/docker
dependency-version: 28.5.0+incompatible
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-09 13:01:44 +02:00
Ettore Di Giacinto
925d752f8d
chore(model gallery): add qwen3-deckard-large-almost-human-6b-i1 ( #6416 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-09 12:45:38 +02:00
Ettore Di Giacinto
c0b9d00f35
chore(model gallery): add gliese-4b-oss-0410-i1 ( #6415 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-09 12:43:35 +02:00
Ettore Di Giacinto
fcf8d41a00
chore(model gallery): add liquidai_lfm2-8b-a1b ( #6414 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-09 12:41:53 +02:00
Ettore Di Giacinto
27c4161401
chore: update cogito and simplify MCP logics ( #6413 )
...
* chore: update cogito and simplify MCP logics
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Refine signal handling
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-09 12:36:45 +02:00
LocalAI [bot]
459b6ab86d
chore: ⬆️ Update ggml-org/whisper.cpp to 8877dfc11a9322ce1990958494cf2e41c54657eb ( #6411 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-09 08:17:49 +02:00
LocalAI [bot]
336257cc3c
chore: ⬆️ Update ggml-org/llama.cpp to 9d0882840e6c3fb62965d03af0e22880ea90e012 ( #6410 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-09 08:17:10 +02:00
Ettore Di Giacinto
df46a438b8
fix(grammars): handle empty parameters on object types ( #6409 )
...
fix: handle empty parameters on object types
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-08 19:13:40 +02:00
LocalAI [bot]
5e1d809904
chore: ⬆️ Update ggml-org/llama.cpp to aeaf8a36f06b5810f5ae4bbefe26edb33925cf5e ( #6408 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-08 08:01:08 +02:00
Ettore Di Giacinto
a9c7ce7275
fix(chatterbox): add poetry
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-07 17:46:52 +02:00
dependabot[bot]
8c47c8c8ed
chore(deps): bump github.com/docker/go-connections from 0.5.0 to 0.6.0 ( #6393 )
...
Bumps [github.com/docker/go-connections](https://github.com/docker/go-connections ) from 0.5.0 to 0.6.0.
- [Commits](https://github.com/docker/go-connections/compare/v0.5.0...v0.6.0 )
---
updated-dependencies:
- dependency-name: github.com/docker/go-connections
dependency-version: 0.6.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-07 16:29:33 +02:00
dependabot[bot]
8e8d427549
chore(deps): bump github.com/nikolalohinski/gonja/v2 from 2.3.2 to 2.4.1 ( #6394 )
...
Bumps [github.com/nikolalohinski/gonja/v2](https://github.com/nikolalohinski/gonja ) from 2.3.2 to 2.4.1.
- [Commits](https://github.com/nikolalohinski/gonja/compare/v2.3.2...v2.4.1 )
---
updated-dependencies:
- dependency-name: github.com/nikolalohinski/gonja/v2
dependency-version: 2.4.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-07 16:29:09 +02:00
dependabot[bot]
ee251115f4
chore(deps): bump github.com/rs/zerolog from 1.33.0 to 1.34.0 ( #6274 )
...
Bumps [github.com/rs/zerolog](https://github.com/rs/zerolog ) from 1.33.0 to 1.34.0.
- [Commits](https://github.com/rs/zerolog/compare/v1.33.0...v1.34.0 )
---
updated-dependencies:
- dependency-name: github.com/rs/zerolog
dependency-version: 1.34.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-07 16:28:39 +02:00
dependabot[bot]
661e66090c
chore(deps): bump actions/stale from 10.0.0 to 10.1.0 ( #6392 )
...
Bumps [actions/stale](https://github.com/actions/stale ) from 10.0.0 to 10.1.0.
- [Release notes](https://github.com/actions/stale/releases )
- [Changelog](https://github.com/actions/stale/blob/main/CHANGELOG.md )
- [Commits](3a9db7e6a4...5f858e3efb )
---
updated-dependencies:
- dependency-name: actions/stale
dependency-version: 10.1.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-07 16:28:20 +02:00
Ettore Di Giacinto
c38564e22c
Update docs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-07 16:25:03 +02:00
Ettore Di Giacinto
20f1e842b3
fix(chatterbox): chunk long text ( #6407 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-07 12:41:34 +02:00
Ettore Di Giacinto
aa8965b634
Update mcp.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-07 11:45:07 +02:00
Ettore Di Giacinto
35c676188b
feat(ui): display in index when a model supports MCP ( #6406 )
...
Display in index when a model supports MCP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-07 11:43:24 +02:00
Ettore Di Giacinto
183559bb98
chore(docs): add MCP example ( #6405 )
...
docs update
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-07 11:42:28 +02:00
LocalAI [bot]
1123a5c49c
chore: ⬆️ Update ggml-org/whisper.cpp to c8223a8548ad64435266e551385fc51aca9ee8ab ( #6402 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-07 08:50:22 +02:00
LocalAI [bot]
6f17c260a7
chore: ⬆️ Update ggml-org/llama.cpp to 3df2244df40c67dfd6ad548b40ccc507a066af2b ( #6401 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-07 08:44:02 +02:00
Ettore Di Giacinto
da6278aae9
feat(api): support both /v1 and not on openai routes ( #6403 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-07 00:06:06 +02:00
Ettore Di Giacinto
2e51871ad5
feat(ui): add button to enable Agentic MCP ( #6400 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-07 00:05:34 +02:00
Ettore Di Giacinto
8067d25710
chore(model gallery): add liquidai_lfm2-350m-math ( #6391 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-06 09:12:10 +02:00
Ettore Di Giacinto
cb2df6c5bf
chore(model gallery): add liquidai_lfm2-1.2b-tool ( #6390 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-06 09:10:20 +02:00
Ettore Di Giacinto
07e1519b3f
chore(model gallery): add liquidai_lfm2-1.2b-rag ( #6389 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-06 09:08:07 +02:00
Ettore Di Giacinto
8fc41673fa
chore(model gallery): add liquidai_lfm2-1.2b-extract ( #6388 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-06 09:06:08 +02:00
Ettore Di Giacinto
fff0e5911b
chore(model gallery): add liquidai_lfm2-350m-extract ( #6387 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-06 09:03:37 +02:00
Ettore Di Giacinto
09346bdc06
chore(model gallery): add lemon07r_vellummini-0.1-qwen3-14b ( #6386 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-06 08:46:09 +02:00
LocalAI [bot]
d4d42740c8
chore: ⬆️ Update ggml-org/llama.cpp to ca71fb9b368e3db96e028f80c4c9df6b6b370edd ( #6385 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-06 08:24:38 +02:00
LocalAI [bot]
5de7a43319
feat(swagger): update swagger ( #6384 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-05 22:14:10 +02:00
Ettore Di Giacinto
85e27ec74c
feat: add agent options to model config ( #6383 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-05 21:54:04 +02:00
Ettore Di Giacinto
698205a2f3
Add links to Awesome MCPs and MCPs by mudler
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-05 21:27:44 +02:00
Ettore Di Giacinto
3ed582b091
Update README with new project links
...
Added links to Agentic Libraries and MCPs in README.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-05 19:38:47 +02:00
Ettore Di Giacinto
752e33f676
Update MCP link in README for accuracy
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-05 18:50:23 +02:00
Ettore Di Giacinto
930553ef60
Update mcp.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-05 18:21:02 +02:00
Ettore Di Giacinto
fc8d5c9198
fix: add strict mode check for no action function ( #6294 )
...
When strict mode is enabled, force the LLM to pick always a tool
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-05 18:13:34 +02:00
Ettore Di Giacinto
60b6472fa0
feat: Add Agentic MCP support with a new chat/completion endpoint ( #6381 )
...
* WIP - add endpoint
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Rename
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Wire the Completion API
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to make it functional
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Almost functional
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Bump golang versions used in tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add description of the tool
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make it working
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small optimizations
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Cleanup/refactor
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-05 17:51:41 +02:00
LocalAI [bot]
6b2c8277c2
chore: ⬆️ Update ggml-org/llama.cpp to 86df2c9ae4f2f1ee63d2558a9dc797b98524639b ( #6382 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-05 08:52:24 +02:00
LocalAI [bot]
6d5d3ebcf6
chore: ⬆️ Update ggml-org/llama.cpp to 128d522c04286e019666bd6ee4d18e3fbf8772e2 ( #6379 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-04 19:00:50 +02:00
LocalAI [bot]
530c174fd3
docs: ⬆️ update docs version mudler/LocalAI ( #6378 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-03 23:26:09 +02:00
Ettore Di Giacinto
8fb95686af
chore(model gallery): add ibm-granite_granite-4.0-micro ( #6376 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-03 10:03:34 +02:00
Ettore Di Giacinto
4132085c01
chore(model gallery): add ibm-granite_granite-4.0-h-micro ( #6375 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-03 09:32:20 +02:00
Ettore Di Giacinto
c14f1ffcfd
chore(model gallery): add ibm-granite_granite-4.0-h-tiny ( #6374 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-03 09:31:00 +02:00
Ettore Di Giacinto
07cca4b69a
chore(model gallery): add ibm-granite_granite-4.0-h-small ( #6373 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-03 09:28:57 +02:00
LocalAI [bot]
dd927c36f6
chore: ⬆️ Update ggml-org/llama.cpp to d64c8104f090b27b1f99e8da5995ffcfa6b726e2 ( #6371 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-02 21:09:00 +00:00
LocalAI [bot]
052f42e926
chore: ⬆️ Update ggml-org/llama.cpp to 1fe4e38cc20af058ed320bd46cac934991190056 ( #6368 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-10-02 16:29:57 +02:00
LocalAI [bot]
30d43588ab
chore: ⬆️ Update ggml-org/whisper.cpp to 7849aff7a2e1f4234aa31b01a1870906d5431959 ( #6367 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-01 21:15:28 +00:00
LocalAI [bot]
d21ec22f74
chore: ⬆️ Update ggml-org/whisper.cpp to 8c0855fd6bb115e113c0dca6255ea05f774d35f7 ( #6365 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-10-01 12:12:27 +02:00
LocalAI [bot]
04fecd634a
chore: ⬆️ Update ggml-org/llama.cpp to b2ba81dbe07b6dbea9c96b13346c66973dede32c ( #6366 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-30 21:13:23 +00:00
LocalAI [bot]
33c14198db
chore: ⬆️ Update ggml-org/llama.cpp to 5f7e166cbf7b9ca928c7fad990098ef32358ac75 ( #6355 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-30 14:41:16 +02:00
LocalAI [bot]
967c2727e3
chore: ⬆️ Update ggml-org/whisper.cpp to 32be14f8ebfc0498c2c619182f0d7f4c822d52c4 ( #6354 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-30 14:40:59 +02:00
dependabot[bot]
f41f30ad92
chore(deps): bump grpcio from 1.74.0 to 1.75.1 in /backend/python/exllama2 ( #6356 )
...
chore(deps): bump grpcio in /backend/python/exllama2
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.74.0 to 1.75.1.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.74.0...v1.75.1 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.75.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-30 14:40:41 +02:00
dependabot[bot]
e77340e8a5
chore(deps): bump grpcio from 1.75.0 to 1.75.1 in /backend/python/transformers ( #6362 )
...
chore(deps): bump grpcio in /backend/python/transformers
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.75.0 to 1.75.1.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.75.0...v1.75.1 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.75.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-30 14:40:29 +02:00
dependabot[bot]
d51a3090f7
chore(deps): bump grpcio from 1.74.0 to 1.75.1 in /backend/python/bark ( #6359 )
...
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.74.0 to 1.75.1.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.74.0...v1.75.1 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.75.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-30 14:40:16 +02:00
dependabot[bot]
1bf3bc932c
chore(deps): bump grpcio from 1.74.0 to 1.75.1 in /backend/python/vllm ( #6357 )
...
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.74.0 to 1.75.1.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.74.0...v1.75.1 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.75.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-30 14:40:02 +02:00
dependabot[bot]
564a47da4e
chore(deps): bump grpcio from 1.74.0 to 1.75.1 in /backend/python/common/template ( #6358 )
...
chore(deps): bump grpcio in /backend/python/common/template
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.74.0 to 1.75.1.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.74.0...v1.75.1 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.75.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-30 08:52:36 +02:00
dependabot[bot]
c37ee93ff2
chore(deps): bump grpcio from 1.74.0 to 1.75.1 in /backend/python/rerankers ( #6360 )
...
chore(deps): bump grpcio in /backend/python/rerankers
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.74.0 to 1.75.1.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.74.0...v1.75.1 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.75.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-30 08:52:25 +02:00
dependabot[bot]
f4b65db4e7
chore(deps): bump grpcio from 1.74.0 to 1.75.1 in /backend/python/diffusers ( #6361 )
...
chore(deps): bump grpcio in /backend/python/diffusers
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.74.0 to 1.75.1.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.74.0...v1.75.1 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.75.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-30 08:52:11 +02:00
Ettore Di Giacinto
f5fa8e6649
Revert "chore(deps): bump transformers from 4.48.3 to 4.56.2 in /backend/python/coqui" ( #6363 )
...
Revert "chore(deps): bump transformers from 4.48.3 to 4.56.2 in /backend/pyth…"
This reverts commit 570e39bdcf .
2025-09-30 08:51:49 +02:00
dependabot[bot]
570e39bdcf
chore(deps): bump transformers from 4.48.3 to 4.56.2 in /backend/python/coqui ( #6330 )
...
chore(deps): bump transformers in /backend/python/coqui
Bumps [transformers](https://github.com/huggingface/transformers ) from 4.48.3 to 4.56.2.
- [Release notes](https://github.com/huggingface/transformers/releases )
- [Commits](https://github.com/huggingface/transformers/compare/v4.48.3...v4.56.2 )
---
updated-dependencies:
- dependency-name: transformers
dependency-version: 4.56.2
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-29 21:30:29 +00:00
dependabot[bot]
2ebe37b671
chore(deps): bump grpcio from 1.74.0 to 1.75.1 in /backend/python/coqui ( #6353 )
...
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.74.0 to 1.75.1.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.74.0...v1.75.1 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.75.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-29 20:11:55 +00:00
LocalAI [bot]
dca685f784
chore: ⬆️ Update ggml-org/llama.cpp to bd0af02fc96c2057726f33c0f0daf7bb8f3e462a ( #6352 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-28 21:08:50 +00:00
LocalAI [bot]
84ebf2a2c9
chore: ⬆️ Update ggml-org/llama.cpp to 4807e8f96a61b2adccebd5e57444c94d18de7264 ( #6350 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-28 00:33:46 +02:00
Ettore Di Giacinto
ce5662ba90
chore(deps): bump llama.cpp to '72b24d96c6888c609d562779a23787304ae4609c' ( #6349 )
...
* chore(deps): bump llama.cpp to '72b24d96c6888c609d562779a23787304ae4609c'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Disable OPENSSL (just introduced upstream)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-27 13:55:51 +02:00
Ettore Di Giacinto
9878f27813
chore(deps): bump llama.cpp to '835b2b915c52bcabcd688d025eacff9a07b65f52' ( #6347 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-26 23:26:14 +02:00
jongames
f2b9452ec4
fix: reranking models limited to 512 tokens in llama.cpp backend ( #6344 )
...
Fix reranking models being limited to 512 tokens input in llama.cpp backend
Signed-off-by: JonGames <18472148+jongames@users.noreply.github.com >
2025-09-25 23:32:07 +00:00
Ettore Di Giacinto
585da99c52
chore(models): add whisper-turbo via whisper.cpp ( #6340 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-25 09:15:06 +02:00
Ettore Di Giacinto
fd4f432079
CI: disable build-testing on PRs against arm64 ( #6341 )
...
CI: disable testing on PRs against arm64
Removed configuration for cublas and arm64 platform.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-09-25 09:14:50 +02:00
LocalAI [bot]
238c68c57b
chore: ⬆️ Update ggml-org/llama.cpp to 4ae88d07d026e66b41e85afece74e88af54f4e66 ( #6339 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-25 08:47:02 +02:00
Ettore Di Giacinto
04fbf5cb82
Change build type and update tag suffix in backend.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-09-24 22:08:29 +02:00
Ettore Di Giacinto
c85d559919
feat(chatterbox): support multilingual ( #6240 )
...
* feat(chatterbox): support multilingual
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add l4t support
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: switch to fork
Until https://github.com/resemble-ai/chatterbox/pull/295 is merged
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-24 18:37:37 +02:00
Ettore Di Giacinto
b5efc4f89e
chore(cudss): add cudds to l4t images ( #6338 )
...
* chore(cudds): add cudds to l4t images
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* add arm64 to CI tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-24 16:46:24 +02:00
Ettore Di Giacinto
3f9c09a4c5
chore(model gallery): add qwen-image-edit-2509 ( #6336 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-24 10:05:03 +02:00
dependabot[bot]
4a84660475
chore(deps): bump securego/gosec from 2.22.8 to 2.22.9 ( #6324 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.8 to 2.22.9.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.8...v2.22.9 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.9
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-23 08:26:50 +02:00
LocalAI [bot]
737248256e
chore: ⬆️ Update ggml-org/llama.cpp to 1d0125bcf1cbd7195ad0faf826a20bc7cec7d3f4 ( #6335 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-22 21:13:34 +00:00
dependabot[bot]
0ae334fc62
chore(deps): bump grpcio from 1.74.0 to 1.75.0 in /backend/python/transformers ( #6332 )
...
chore(deps): bump grpcio in /backend/python/transformers
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.74.0 to 1.75.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.74.0...v1.75.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.75.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-22 19:53:35 +00:00
Ettore Di Giacinto
36c373b7c9
feat(kokoro): add support for l4t devices ( #6322 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-22 10:33:26 +02:00
LocalAI [bot]
6afcb932b7
chore: ⬆️ Update ggml-org/llama.cpp to da30ab5f8696cabb2d4620cdc0aa41a298c54fd6 ( #6321 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-21 21:28:27 +00:00
LocalAI [bot]
357bf571a3
docs: ⬆️ update docs version mudler/LocalAI ( #6318 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-21 08:40:00 +02:00
LocalAI [bot]
e74ade9ebb
chore: ⬆️ Update ggml-org/llama.cpp to 7f766929ca8e8e01dcceb1c526ee584f7e5e1408 ( #6319 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-20 21:05:25 +00:00
LocalAI [bot]
f7f26b8efa
docs: ⬆️ update docs version mudler/LocalAI ( #6315 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-20 09:41:58 +02:00
LocalAI [bot]
75eb98f8bd
chore: ⬆️ Update ggml-org/llama.cpp to f432d8d83e7407073634c5e4fd81a3d23a10827f ( #6316 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-20 09:41:45 +02:00
LocalAI [bot]
c337e7baf7
chore: ⬆️ Update ggml-org/whisper.cpp to 44fa2f647cf2a6953493b21ab83b50d5f5dbc483 ( #6317 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-19 21:14:10 +00:00
Ettore Di Giacinto
660bd45be8
fix(python): make option check uniform across backends ( #6314 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-19 19:56:08 +02:00