Ettore Di Giacinto
0fa0ac4797
fix(videogen): drop incomplete endpoint, add GGUF support for LTX-2 ( #8160 )
...
* Debug
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop openai video endpoint (is not complete)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add download button
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-22 14:09:20 +01:00
LocalAI [bot]
be7ed85838
chore(model gallery): 🤖 add 1 new models via gallery agent ( #8157 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-22 08:25:40 +01:00
LocalAI [bot]
c12b310028
chore: ⬆️ Update ggml-org/llama.cpp to c301172f660a1fe0b42023da990bf7385d69adb4 ( #8151 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-22 00:51:22 +01:00
LocalAI [bot]
0447d5564d
chore: ⬆️ Update leejet/stable-diffusion.cpp to 329571131d62d64a4f49e1acbef49ae02544fdcd ( #8152 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-22 00:50:41 +01:00
Ettore Di Giacinto
22c0eb5421
chore(diffusers): add 'av' to requirements.txt ( #8155 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-21 22:35:00 +01:00
LocalAI [bot]
a0a00fb937
chore(model-gallery): ⬆️ update checksum ( #8153 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 21:45:11 +01:00
LocalAI [bot]
6dd44742ea
feat(swagger): update swagger ( #8150 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 21:44:44 +01:00
Richard Palethorpe
00c72e7d3e
fix(tracing): Create trace buffer on first request to enable tracing at runtime ( #8148 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2026-01-21 18:39:39 +01:00
LocalAI [bot]
d01c335cf6
chore: ⬆️ Update ggml-org/whisper.cpp to 7aa8818647303b567c3a21fe4220b2681988e220 ( #8146 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 17:44:01 +01:00
LocalAI [bot]
5687df4535
chore: ⬆️ Update ggml-org/llama.cpp to ad8d85bd94cc86e89d23407bdebf98f2e6510c61 ( #8145 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 15:41:36 +00:00
Ettore Di Giacinto
f5fade97e6
chore: drop noisy logs ( #8142 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 09:52:20 +01:00
Ettore Di Giacinto
b88ae31e4e
chore(model gallery): add flux 2 and flux 2 klein ( #8141 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 09:46:33 +01:00
Ettore Di Giacinto
f6daaa7c35
chore(deps): Bump llama.cpp to '1c7cf94b22a9dc6b1d32422f72a627787a4783a3' ( #8136 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 00:12:13 +01:00
Ettore Di Giacinto
c491c6ca90
feat(openresponses): Support reasoning blocks ( #8133 )
...
* feat(openresponses): support reasoning blocks
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* allow to disable reasoning, refactor common logic
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add option to only strip reasoning
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add configurations for custom reasoning tokens
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 00:11:45 +01:00
Ettore Di Giacinto
34e054f607
fix(reasoning): support models with reasoning without starting thinking tag ( #8132 )
...
* chore: extract reasoning to its own package
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* make sure we detect thinking tokens from template
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Allow to override via config, add tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-20 21:07:59 +01:00
LocalAI [bot]
e886bb291a
chore(model gallery): 🤖 add 1 new models via gallery agent ( #8128 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-20 12:58:29 +01:00
Ettore Di Giacinto
4bf2f8bbd8
chore(docs): update docs with Anthropic API and openresponses
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-20 09:25:24 +01:00
LocalAI [bot]
d3525b7509
chore: ⬆️ Update ggml-org/llama.cpp to 959ecf7f234dc0bc0cd6829b25cb0ee1481aa78a ( #8122 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-19 22:50:47 +01:00
LocalAI [bot]
c8aa821e0e
chore: ⬆️ Update leejet/stable-diffusion.cpp to a48b4a3ade9972faf0adcad47e51c6fc03f0e46d ( #8121 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-19 22:27:46 +01:00
dependabot[bot]
b3191927ae
chore(deps): bump github.com/mudler/cogito from 0.7.2 to 0.8.1 ( #8124 )
...
Bumps [github.com/mudler/cogito](https://github.com/mudler/cogito ) from 0.7.2 to 0.8.1.
- [Release notes](https://github.com/mudler/cogito/releases )
- [Commits](https://github.com/mudler/cogito/compare/v0.7.2...v0.8.1 )
---
updated-dependencies:
- dependency-name: github.com/mudler/cogito
dependency-version: 0.8.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-19 22:26:26 +01:00
LocalAI [bot]
54c5a2d9ea
docs: ⬆️ update docs version mudler/LocalAI ( #8120 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-19 21:18:24 +00:00
Ettore Di Giacinto
0279591fec
Enable reranking for Qwen3-VL-Reranker-8B
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-19 15:28:58 +01:00
LocalAI [bot]
8845186955
chore: ⬆️ Update leejet/stable-diffusion.cpp to 2efd19978dd4164e387bf226025c9666b6ef35e2 ( #8099 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-18 22:40:35 +01:00
LocalAI [bot]
ab8ed24358
chore: ⬆️ Update ggml-org/llama.cpp to 287a33017b32600bfc0e81feeb0ad6e81e0dd484 ( #8100 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-18 22:40:14 +01:00
LocalAI [bot]
a021df5a88
feat(swagger): update swagger ( #8098 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-18 22:10:06 +01:00
Ettore Di Giacinto
5f403b1631
chore: drop neutts for l4t ( #8101 )
...
Builds exhausts CI currently, and there are better backends at this
point in time. We will probably deprecate it in the future.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
v3.10.0
2026-01-18 21:55:56 +01:00
rampa3
897ad1729e
chore(model gallery): add qwen3-coder-30b-a3b-instruct based on model request ( #8082 )
...
* chore(model gallery): add qwen3-coder-30b-a3b-instruct based on model request
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
* added missing model config import URL
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
---------
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com >
2026-01-18 09:23:07 +01:00
LocalAI [bot]
16a18a2e55
chore: ⬆️ Update leejet/stable-diffusion.cpp to 9565c7f6bd5fcff124c589147b2621244f2c4aa1 ( #8086 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-17 22:12:21 +01:00
Ettore Di Giacinto
3387bfaee0
feat(api): add support for open responses specification ( #8063 )
...
* feat: openresponses
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add ttl settings, fix tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: register cors middleware by default
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* satisfy schema
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Logitbias and logprobs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add grammar
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* SSE compliance
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* tool JSON conversion
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* support background mode
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* swagger
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* drop code. This is handled in the handler
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small refactorings
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* background mode for MCP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-17 22:11:47 +01:00
LocalAI [bot]
1cd33047b4
chore: ⬆️ Update ggml-org/llama.cpp to 2fbde785bc106ae1c4102b0e82b9b41d9c466579 ( #8087 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-17 21:10:18 +00:00
Ettore Di Giacinto
1de045311a
chore(ui): add video generation link ( #8079 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-17 09:49:47 +01:00
LocalAI [bot]
5fe9bf9f84
chore: ⬆️ Update ggml-org/whisper.cpp to f53dc74843e97f19f94a79241357f74ad5b691a6 ( #8074 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-17 08:32:53 +01:00
LocalAI [bot]
d4fd0c0609
chore: ⬆️ Update ggml-org/llama.cpp to 388ce822415f24c60fcf164a321455f1e008cafb ( #8073 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-16 21:22:33 +00:00
Ettore Di Giacinto
d16722ee13
Revert "chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/python/rerankers in the pip group across 1 directory" ( #8072 )
...
Revert "chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/pyt…"
This reverts commit 1f10ab39a9 .
2026-01-16 20:50:33 +01:00
dependabot[bot]
1f10ab39a9
chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/python/rerankers in the pip group across 1 directory ( #8066 )
...
chore(deps): bump torch
Bumps the pip group with 1 update in the /backend/python/rerankers directory: [torch](https://github.com/pytorch/pytorch ).
Updates `torch` from 2.3.1+cxx11.abi to 2.8.0
- [Release notes](https://github.com/pytorch/pytorch/releases )
- [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md )
- [Commits](https://github.com/pytorch/pytorch/commits/v2.8.0 )
---
updated-dependencies:
- dependency-name: torch
dependency-version: 2.8.0
dependency-type: direct:production
dependency-group: pip
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-16 19:38:12 +00:00
Ettore Di Giacinto
4d36e393d1
fix(ci): use more beefy runner for expensive jobs ( #8065 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-16 19:26:40 +01:00
LocalAI [bot]
cb8616c7d1
chore: ⬆️ Update ggml-org/llama.cpp to 785a71008573e2d84728fb0ba9e851d72d3f8fab ( #8053 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-15 22:53:17 +01:00
LocalAI [bot]
ff31d50488
chore: ⬆️ Update ggml-org/whisper.cpp to 2eeeba56e9edd762b4b38467bab96c2517163158 ( #8052 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-15 22:52:56 +01:00
Divyanshupandey007
1a50717e33
fix: reduce log verbosity for /api/operations polling ( #8050 )
...
* fix: reduce log verbosity for /api/operations polling
Reduces log clutter by changing the log level from INFO to DEBUG for successful (200 OK) /api/operations requests. This endpoint is polled frequently by the Web UI, causing log spam. Fixes #7989 .
* fix: reduce log verbosity for /api/operations polling
Reduces log clutter by changing the log level from INFO to DEBUG for successful (200 OK) /api/operations requests. This endpoint is polled frequently by the Web UI, causing log spam. Fixes #7989 .
2026-01-15 21:13:13 +01:00
LocalAI [bot]
49d6305509
chore: ⬆️ Update ggml-org/llama.cpp to d98b548120eecf98f0f6eaa1ba7e29b3afda9f2e ( #8040 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-15 08:39:46 +01:00
Ettore Di Giacinto
d20a113aef
fix(functions): do not duplicate function when valid JSON is inside XML tags ( #8043 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-14 23:42:00 +01:00
LocalAI [bot]
cbaa793520
chore: ⬆️ Update ggml-org/whisper.cpp to 47af2fb70f7e4ee1ba40c8bed513760fdfe7a704 ( #8039 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-14 22:12:32 +01:00
Ettore Di Giacinto
6fe3fc880f
Update section headers in README.md for clarity
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-14 22:11:58 +01:00
Ettore Di Giacinto
752e641c48
Clarify Docker usage in README
...
Updated Docker section in README to clarify usage.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-14 22:10:59 +01:00
Ettore Di Giacinto
44d78b4d15
chore(doc): put alert on install.sh until is fixed ( #8042 )
...
See: https://github.com/mudler/LocalAI/issues/8032
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-14 22:08:48 +01:00
Ettore Di Giacinto
64d0a96ba3
feat(ui): add video gen UI ( #8020 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-14 11:43:32 +01:00
Ettore Di Giacinto
b19afc9e64
feat(diffusers): add support to LTX-2 ( #8019 )
...
* feat(diffusers): add support to LTX-2
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add to the gallery
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-14 09:07:30 +01:00
LocalAI [bot]
d6e698876b
chore: ⬆️ Update ggml-org/llama.cpp to e4832e3ae4d58ac0ecbdbf4ae055424d6e628c9f ( #8015 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-14 08:09:37 +01:00
LocalAI [bot]
8962205546
chore: ⬆️ Update ggml-org/whisper.cpp to a96310871a3b294f026c3bcad4e715d17b5905fe ( #8014 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-14 08:09:00 +01:00
LocalAI [bot]
eddc460118
chore: ⬆️ Update leejet/stable-diffusion.cpp to 7010bb4dff7bd55b03d35ef9772142c21699eba9 ( #8013 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-14 08:08:31 +01:00