LocalAI [bot]
05a332cd5f
chore: ⬆️ Update ggml-org/llama.cpp to bb02f74c612064947e51d23269a1cf810b67c9a7 ( #8196 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-24 21:19:43 +00:00
Ettore Di Giacinto
05904c77f5
chore(exllama): drop backend now almost deprecated ( #8186 )
...
exllama2 development has stalled and only old architectures are
supported. exllamav3 is still in development, meanwhile cleaning up
exllama2 from the gallery.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-24 08:57:37 +01:00
LocalAI [bot]
17783fa7d9
chore: ⬆️ Update leejet/stable-diffusion.cpp to fa61ea744d1a87fa26a63f8a86e45587bc9534d6 ( #8184 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-24 08:57:24 +01:00
LocalAI [bot]
4019094111
chore: ⬆️ Update ggml-org/llama.cpp to 557515be1e93ed8939dd8a7c7d08765fdbe8be31 ( #8183 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-24 08:57:08 +01:00
Ettore Di Giacinto
ca65fc751a
chore(model gallery): add qwen3-tts to model gallery ( #8187 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-23 23:06:50 +01:00
LocalAI [bot]
a1e3acc590
docs: ⬆️ update docs version mudler/LocalAI ( #8182 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-23 22:03:47 +01:00
Ettore Di Giacinto
a36960e069
fix(qwen-tts): change icon URL in index.yaml
...
Updated the icon URL for the project.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-23 22:00:14 +01:00
Ettore Di Giacinto
58bb6a29ed
Revert "chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/bark in the pip group across 1 directory" ( #8180 )
...
Revert "chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/ba…"
This reverts commit 5881c82413 .
2026-01-23 17:25:04 +01:00
dependabot[bot]
5881c82413
chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/bark in the pip group across 1 directory ( #8175 )
...
chore(deps): bump torch
Bumps the pip group with 1 update in the /backend/python/bark directory: torch.
Updates `torch` from 2.4.1 to 2.7.1+xpu
---
updated-dependencies:
- dependency-name: torch
dependency-version: 2.7.1+xpu
dependency-type: direct:production
dependency-group: pip
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-23 15:32:15 +00:00
Ettore Di Giacinto
923ebbb344
feat(qwen-tts): add Qwen-tts backend ( #8163 )
...
* feat(qwen-tts): add Qwen-tts backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update intel deps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop flash-attn for cuda13
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
v3.10.1
2026-01-23 15:18:41 +01:00
LocalAI [bot]
ea51567b89
chore(model gallery): 🤖 add 1 new models via gallery agent ( #8170 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-23 08:19:39 +01:00
LocalAI [bot]
552c62a19c
chore: ⬆️ Update leejet/stable-diffusion.cpp to 5e4579c11d0678f9765463582d024e58270faa9c ( #8166 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-23 08:18:05 +01:00
Ettore Di Giacinto
c0b21a921b
feat: detect thinking support from backend automatically if not explicitly set ( #8167 )
...
detect thinking support from backend automatically if not explicitly set
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-23 00:38:28 +01:00
LocalAI [bot]
b10045adc2
chore: ⬆️ Update ggml-org/llama.cpp to a5eaa1d6a3732bc0f460b02b61c95680bba5a012 ( #8165 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-22 23:32:05 +00:00
Ettore Di Giacinto
61b5e3b629
chore: drop test file
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-22 22:19:38 +00:00
Ettore Di Giacinto
e35d7cb3b3
chore: drop test file
...
the function now was removed
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-22 21:47:52 +00:00
Ettore Di Giacinto
0fa0ac4797
fix(videogen): drop incomplete endpoint, add GGUF support for LTX-2 ( #8160 )
...
* Debug
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop openai video endpoint (is not complete)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add download button
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-22 14:09:20 +01:00
LocalAI [bot]
be7ed85838
chore(model gallery): 🤖 add 1 new models via gallery agent ( #8157 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-22 08:25:40 +01:00
LocalAI [bot]
c12b310028
chore: ⬆️ Update ggml-org/llama.cpp to c301172f660a1fe0b42023da990bf7385d69adb4 ( #8151 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-22 00:51:22 +01:00
LocalAI [bot]
0447d5564d
chore: ⬆️ Update leejet/stable-diffusion.cpp to 329571131d62d64a4f49e1acbef49ae02544fdcd ( #8152 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-22 00:50:41 +01:00
Ettore Di Giacinto
22c0eb5421
chore(diffusers): add 'av' to requirements.txt ( #8155 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-21 22:35:00 +01:00
LocalAI [bot]
a0a00fb937
chore(model-gallery): ⬆️ update checksum ( #8153 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 21:45:11 +01:00
LocalAI [bot]
6dd44742ea
feat(swagger): update swagger ( #8150 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 21:44:44 +01:00
Richard Palethorpe
00c72e7d3e
fix(tracing): Create trace buffer on first request to enable tracing at runtime ( #8148 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2026-01-21 18:39:39 +01:00
LocalAI [bot]
d01c335cf6
chore: ⬆️ Update ggml-org/whisper.cpp to 7aa8818647303b567c3a21fe4220b2681988e220 ( #8146 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 17:44:01 +01:00
LocalAI [bot]
5687df4535
chore: ⬆️ Update ggml-org/llama.cpp to ad8d85bd94cc86e89d23407bdebf98f2e6510c61 ( #8145 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 15:41:36 +00:00
Ettore Di Giacinto
f5fade97e6
chore: drop noisy logs ( #8142 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 09:52:20 +01:00
Ettore Di Giacinto
b88ae31e4e
chore(model gallery): add flux 2 and flux 2 klein ( #8141 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 09:46:33 +01:00
Ettore Di Giacinto
f6daaa7c35
chore(deps): Bump llama.cpp to '1c7cf94b22a9dc6b1d32422f72a627787a4783a3' ( #8136 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 00:12:13 +01:00
Ettore Di Giacinto
c491c6ca90
feat(openresponses): Support reasoning blocks ( #8133 )
...
* feat(openresponses): support reasoning blocks
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* allow to disable reasoning, refactor common logic
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add option to only strip reasoning
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add configurations for custom reasoning tokens
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 00:11:45 +01:00
Ettore Di Giacinto
34e054f607
fix(reasoning): support models with reasoning without starting thinking tag ( #8132 )
...
* chore: extract reasoning to its own package
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* make sure we detect thinking tokens from template
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Allow to override via config, add tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-20 21:07:59 +01:00
LocalAI [bot]
e886bb291a
chore(model gallery): 🤖 add 1 new models via gallery agent ( #8128 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-20 12:58:29 +01:00
Ettore Di Giacinto
4bf2f8bbd8
chore(docs): update docs with Anthropic API and openresponses
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-20 09:25:24 +01:00
LocalAI [bot]
d3525b7509
chore: ⬆️ Update ggml-org/llama.cpp to 959ecf7f234dc0bc0cd6829b25cb0ee1481aa78a ( #8122 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-19 22:50:47 +01:00
LocalAI [bot]
c8aa821e0e
chore: ⬆️ Update leejet/stable-diffusion.cpp to a48b4a3ade9972faf0adcad47e51c6fc03f0e46d ( #8121 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-19 22:27:46 +01:00
dependabot[bot]
b3191927ae
chore(deps): bump github.com/mudler/cogito from 0.7.2 to 0.8.1 ( #8124 )
...
Bumps [github.com/mudler/cogito](https://github.com/mudler/cogito ) from 0.7.2 to 0.8.1.
- [Release notes](https://github.com/mudler/cogito/releases )
- [Commits](https://github.com/mudler/cogito/compare/v0.7.2...v0.8.1 )
---
updated-dependencies:
- dependency-name: github.com/mudler/cogito
dependency-version: 0.8.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-19 22:26:26 +01:00
LocalAI [bot]
54c5a2d9ea
docs: ⬆️ update docs version mudler/LocalAI ( #8120 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-19 21:18:24 +00:00
Ettore Di Giacinto
0279591fec
Enable reranking for Qwen3-VL-Reranker-8B
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-19 15:28:58 +01:00
LocalAI [bot]
8845186955
chore: ⬆️ Update leejet/stable-diffusion.cpp to 2efd19978dd4164e387bf226025c9666b6ef35e2 ( #8099 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-18 22:40:35 +01:00
LocalAI [bot]
ab8ed24358
chore: ⬆️ Update ggml-org/llama.cpp to 287a33017b32600bfc0e81feeb0ad6e81e0dd484 ( #8100 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-18 22:40:14 +01:00
LocalAI [bot]
a021df5a88
feat(swagger): update swagger ( #8098 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-18 22:10:06 +01:00
Ettore Di Giacinto
5f403b1631
chore: drop neutts for l4t ( #8101 )
...
Builds exhausts CI currently, and there are better backends at this
point in time. We will probably deprecate it in the future.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
v3.10.0
2026-01-18 21:55:56 +01:00
rampa3
897ad1729e
chore(model gallery): add qwen3-coder-30b-a3b-instruct based on model request ( #8082 )
...
* chore(model gallery): add qwen3-coder-30b-a3b-instruct based on model request
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
* added missing model config import URL
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
---------
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
2026-01-18 09:23:07 +01:00
LocalAI [bot]
16a18a2e55
chore: ⬆️ Update leejet/stable-diffusion.cpp to 9565c7f6bd5fcff124c589147b2621244f2c4aa1 ( #8086 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-17 22:12:21 +01:00
Ettore Di Giacinto
3387bfaee0
feat(api): add support for open responses specification ( #8063 )
...
* feat: openresponses
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add ttl settings, fix tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: register cors middleware by default
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* satisfy schema
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Logitbias and logprobs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add grammar
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* SSE compliance
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* tool JSON conversion
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* support background mode
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* swagger
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* drop code. This is handled in the handler
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small refactorings
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* background mode for MCP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-17 22:11:47 +01:00
LocalAI [bot]
1cd33047b4
chore: ⬆️ Update ggml-org/llama.cpp to 2fbde785bc106ae1c4102b0e82b9b41d9c466579 ( #8087 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-17 21:10:18 +00:00
Ettore Di Giacinto
1de045311a
chore(ui): add video generation link ( #8079 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-17 09:49:47 +01:00
LocalAI [bot]
5fe9bf9f84
chore: ⬆️ Update ggml-org/whisper.cpp to f53dc74843e97f19f94a79241357f74ad5b691a6 ( #8074 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-17 08:32:53 +01:00
LocalAI [bot]
d4fd0c0609
chore: ⬆️ Update ggml-org/llama.cpp to 388ce822415f24c60fcf164a321455f1e008cafb ( #8073 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-16 21:22:33 +00:00
Ettore Di Giacinto
d16722ee13
Revert "chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/python/rerankers in the pip group across 1 directory" ( #8072 )
...
Revert "chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/pyt…"
This reverts commit 1f10ab39a9 .
2026-01-16 20:50:33 +01:00