Andres
b6459ddd57
feat(api): Add transcribe response format request parameter & adjust STT backends ( #8318 )
...
* WIP response format implementation for audio transcriptions
(cherry picked from commit e271dd764bbc13846accf3beb8b6522153aa276f)
Signed-off-by: Andres Smith <andressmithdev@pm.me >
* Rework transcript response_format and add more formats
(cherry picked from commit 6a93a8f63e2ee5726bca2980b0c9cf4ef8b7aeb8)
Signed-off-by: Andres Smith <andressmithdev@pm.me >
* Add test and replace go-openai package with official openai go client
(cherry picked from commit f25d1a04e46526429c89db4c739e1e65942ca893)
Signed-off-by: Andres Smith <andressmithdev@pm.me >
* Fix faster-whisper backend and refactor transcription formatting to also work on CLI
Signed-off-by: Andres Smith <andressmithdev@pm.me >
(cherry picked from commit 69a93977d5e113eb7172bd85a0f918592d3d2168)
Signed-off-by: Andres Smith <andressmithdev@pm.me >
---------
Signed-off-by: Andres Smith <andressmithdev@pm.me >
Co-authored-by: nanoandrew4 <nanoandrew4@gmail.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-02-01 17:33:17 +01:00
LocalAI [bot]
3445415b3d
chore: ⬆️ Update ggml-org/llama.cpp to 41ea26144e55d23f37bb765f88c07588d786567f ( #8317 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-31 21:18:31 +00:00
LocalAI [bot]
b05e110aa6
chore: ⬆️ Update ggml-org/llama.cpp to 1488339138d609139c4400d1b80f8a5b1a9a203c ( #8306 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-31 08:59:09 +01:00
LocalAI [bot]
e69cba2444
chore: ⬆️ Update ggml-org/whisper.cpp to aa1bc0d1a6dfd70dbb9f60c11df12441e03a9075 ( #8305 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-31 08:58:54 +01:00
Ettore Di Giacinto
68dd9765a0
feat(tts): add support for streaming mode ( #8291 )
...
* feat(tts): add support for streaming mode
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Send first audio, make sure it's 16
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-30 11:58:01 +01:00
LocalAI [bot]
2c44b06a67
chore: ⬆️ Update ggml-org/llama.cpp to 4fdbc1e4dba428ce0cf9d2ac22232dc170bbca82 ( #8283 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-29 23:43:29 +01:00
Ettore Di Giacinto
1e08e02598
feat(qwen-asr): add support to qwen-asr ( #8281 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-29 21:50:35 +01:00
Ettore Di Giacinto
48e08772f3
chore(llama.cpp): bump to 'f6b533d898ce84bae8d9fa8dfc6697ac087800bf' ( #8275 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-29 00:22:25 +01:00
LocalAI [bot]
c28c0227c6
chore: ⬆️ Update leejet/stable-diffusion.cpp to e411520407663e1ddf8ff2e5ed4ff3a116fbbc97 ( #8274 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-28 21:23:05 +00:00
Ettore Di Giacinto
9b973b79f6
feat: add VoxCPM tts backend ( #8109 )
...
* feat: add VoxCPM tts backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Disable voxcpm on arm64 cpu
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-28 14:44:04 +01:00
Ettore Di Giacinto
cba8ef4e38
chore: fix backend icons
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-28 09:09:00 +01:00
Ettore Di Giacinto
f729e300d6
chore: fix backend icons
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-28 09:08:03 +01:00
LocalAI [bot]
9916811a79
chore: ⬆️ Update ggml-org/llama.cpp to 2b4cbd2834e427024bc7f935a1f232aecac6679b ( #8258 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-28 08:50:16 +01:00
Ettore Di Giacinto
ec1598868b
feat(vibevoice): add ASR support ( #8222 )
...
* feat(vibevoice): add ASR support
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(tests): download voice files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to run on bigger runner
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* debug
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* CI can't hold vibevoice
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-27 20:19:22 +01:00
LocalAI [bot]
3c1f823c47
chore: ⬆️ Update ggml-org/llama.cpp to 8f80d1b254aef70a0959e314be368d05debe7294 ( #8229 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-26 21:19:43 +00:00
LocalAI [bot]
f76958d761
chore: ⬆️ Update ggml-org/llama.cpp to 0440bfd1605333726ea0fb7a836942660bf2f9a6 ( #8216 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-26 00:50:35 +01:00
LocalAI [bot]
2bd5ca45de
chore: ⬆️ Update leejet/stable-diffusion.cpp to 43e829f21966abb96b08c712bccee872dc820914 ( #8215 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-26 00:50:16 +01:00
Ettore Di Giacinto
26a374b717
chore: drop bark which is unmaintained ( #8207 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-25 09:26:40 +01:00
Ettore Di Giacinto
b2a8a63899
feat(vllm-omni): add new backend ( #8188 )
...
* feat(vllm-omni: add new backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* default to py3.12
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-24 22:23:30 +01:00
LocalAI [bot]
05a332cd5f
chore: ⬆️ Update ggml-org/llama.cpp to bb02f74c612064947e51d23269a1cf810b67c9a7 ( #8196 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-24 21:19:43 +00:00
Ettore Di Giacinto
05904c77f5
chore(exllama): drop backend now almost deprecated ( #8186 )
...
exllama2 development has stalled and only old architectures are
supported. exllamav3 is still in development, meanwhile cleaning up
exllama2 from the gallery.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-24 08:57:37 +01:00
LocalAI [bot]
17783fa7d9
chore: ⬆️ Update leejet/stable-diffusion.cpp to fa61ea744d1a87fa26a63f8a86e45587bc9534d6 ( #8184 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-24 08:57:24 +01:00
LocalAI [bot]
4019094111
chore: ⬆️ Update ggml-org/llama.cpp to 557515be1e93ed8939dd8a7c7d08765fdbe8be31 ( #8183 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-24 08:57:08 +01:00
Ettore Di Giacinto
a36960e069
fix(qwen-tts): change icon URL in index.yaml
...
Updated the icon URL for the project.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-23 22:00:14 +01:00
Ettore Di Giacinto
58bb6a29ed
Revert "chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/bark in the pip group across 1 directory" ( #8180 )
...
Revert "chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/ba…"
This reverts commit 5881c82413 .
2026-01-23 17:25:04 +01:00
dependabot[bot]
5881c82413
chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/bark in the pip group across 1 directory ( #8175 )
...
chore(deps): bump torch
Bumps the pip group with 1 update in the /backend/python/bark directory: torch.
Updates `torch` from 2.4.1 to 2.7.1+xpu
---
updated-dependencies:
- dependency-name: torch
dependency-version: 2.7.1+xpu
dependency-type: direct:production
dependency-group: pip
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-23 15:32:15 +00:00
Ettore Di Giacinto
923ebbb344
feat(qwen-tts): add Qwen-tts backend ( #8163 )
...
* feat(qwen-tts): add Qwen-tts backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update intel deps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop flash-attn for cuda13
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-23 15:18:41 +01:00
LocalAI [bot]
552c62a19c
chore: ⬆️ Update leejet/stable-diffusion.cpp to 5e4579c11d0678f9765463582d024e58270faa9c ( #8166 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-23 08:18:05 +01:00
Ettore Di Giacinto
c0b21a921b
feat: detect thinking support from backend automatically if not explicitly set ( #8167 )
...
detect thinking support from backend automatically if not explicitly set
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-23 00:38:28 +01:00
LocalAI [bot]
b10045adc2
chore: ⬆️ Update ggml-org/llama.cpp to a5eaa1d6a3732bc0f460b02b61c95680bba5a012 ( #8165 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-22 23:32:05 +00:00
Ettore Di Giacinto
0fa0ac4797
fix(videogen): drop incomplete endpoint, add GGUF support for LTX-2 ( #8160 )
...
* Debug
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop openai video endpoint (is not complete)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add download button
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-22 14:09:20 +01:00
LocalAI [bot]
c12b310028
chore: ⬆️ Update ggml-org/llama.cpp to c301172f660a1fe0b42023da990bf7385d69adb4 ( #8151 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-22 00:51:22 +01:00
LocalAI [bot]
0447d5564d
chore: ⬆️ Update leejet/stable-diffusion.cpp to 329571131d62d64a4f49e1acbef49ae02544fdcd ( #8152 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-22 00:50:41 +01:00
Ettore Di Giacinto
22c0eb5421
chore(diffusers): add 'av' to requirements.txt ( #8155 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2026-01-21 22:35:00 +01:00
LocalAI [bot]
d01c335cf6
chore: ⬆️ Update ggml-org/whisper.cpp to 7aa8818647303b567c3a21fe4220b2681988e220 ( #8146 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 17:44:01 +01:00
LocalAI [bot]
5687df4535
chore: ⬆️ Update ggml-org/llama.cpp to ad8d85bd94cc86e89d23407bdebf98f2e6510c61 ( #8145 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-21 15:41:36 +00:00
Ettore Di Giacinto
f6daaa7c35
chore(deps): Bump llama.cpp to '1c7cf94b22a9dc6b1d32422f72a627787a4783a3' ( #8136 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-21 00:12:13 +01:00
LocalAI [bot]
d3525b7509
chore: ⬆️ Update ggml-org/llama.cpp to 959ecf7f234dc0bc0cd6829b25cb0ee1481aa78a ( #8122 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-19 22:50:47 +01:00
LocalAI [bot]
c8aa821e0e
chore: ⬆️ Update leejet/stable-diffusion.cpp to a48b4a3ade9972faf0adcad47e51c6fc03f0e46d ( #8121 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-19 22:27:46 +01:00
LocalAI [bot]
8845186955
chore: ⬆️ Update leejet/stable-diffusion.cpp to 2efd19978dd4164e387bf226025c9666b6ef35e2 ( #8099 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-18 22:40:35 +01:00
LocalAI [bot]
ab8ed24358
chore: ⬆️ Update ggml-org/llama.cpp to 287a33017b32600bfc0e81feeb0ad6e81e0dd484 ( #8100 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-18 22:40:14 +01:00
Ettore Di Giacinto
5f403b1631
chore: drop neutts for l4t ( #8101 )
...
Builds exhausts CI currently, and there are better backends at this
point in time. We will probably deprecate it in the future.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-01-18 21:55:56 +01:00
LocalAI [bot]
16a18a2e55
chore: ⬆️ Update leejet/stable-diffusion.cpp to 9565c7f6bd5fcff124c589147b2621244f2c4aa1 ( #8086 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-17 22:12:21 +01:00
LocalAI [bot]
1cd33047b4
chore: ⬆️ Update ggml-org/llama.cpp to 2fbde785bc106ae1c4102b0e82b9b41d9c466579 ( #8087 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-17 21:10:18 +00:00
LocalAI [bot]
5fe9bf9f84
chore: ⬆️ Update ggml-org/whisper.cpp to f53dc74843e97f19f94a79241357f74ad5b691a6 ( #8074 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-17 08:32:53 +01:00
LocalAI [bot]
d4fd0c0609
chore: ⬆️ Update ggml-org/llama.cpp to 388ce822415f24c60fcf164a321455f1e008cafb ( #8073 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-16 21:22:33 +00:00
Ettore Di Giacinto
d16722ee13
Revert "chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/python/rerankers in the pip group across 1 directory" ( #8072 )
...
Revert "chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/pyt…"
This reverts commit 1f10ab39a9 .
2026-01-16 20:50:33 +01:00
dependabot[bot]
1f10ab39a9
chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/python/rerankers in the pip group across 1 directory ( #8066 )
...
chore(deps): bump torch
Bumps the pip group with 1 update in the /backend/python/rerankers directory: [torch](https://github.com/pytorch/pytorch ).
Updates `torch` from 2.3.1+cxx11.abi to 2.8.0
- [Release notes](https://github.com/pytorch/pytorch/releases )
- [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md )
- [Commits](https://github.com/pytorch/pytorch/commits/v2.8.0 )
---
updated-dependencies:
- dependency-name: torch
dependency-version: 2.8.0
dependency-type: direct:production
dependency-group: pip
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2026-01-16 19:38:12 +00:00
LocalAI [bot]
cb8616c7d1
chore: ⬆️ Update ggml-org/llama.cpp to 785a71008573e2d84728fb0ba9e851d72d3f8fab ( #8053 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-15 22:53:17 +01:00
LocalAI [bot]
ff31d50488
chore: ⬆️ Update ggml-org/whisper.cpp to 2eeeba56e9edd762b4b38467bab96c2517163158 ( #8052 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2026-01-15 22:52:56 +01:00