LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-22 15:49:12 -04:00

Author	SHA1	Message	Date
LocalAI [bot]	05a332cd5f	chore: ⬆️ Update ggml-org/llama.cpp to `bb02f74c612064947e51d23269a1cf810b67c9a7` (#8196 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-24 21:19:43 +00:00
Ettore Di Giacinto	05904c77f5	chore(exllama): drop backend now almost deprecated (#8186 ) exllama2 development has stalled and only old architectures are supported. exllamav3 is still in development, meanwhile cleaning up exllama2 from the gallery. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-24 08:57:37 +01:00
LocalAI [bot]	17783fa7d9	chore: ⬆️ Update leejet/stable-diffusion.cpp to `fa61ea744d1a87fa26a63f8a86e45587bc9534d6` (#8184 ) ⬆️ Update leejet/stable-diffusion.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-24 08:57:24 +01:00
LocalAI [bot]	4019094111	chore: ⬆️ Update ggml-org/llama.cpp to `557515be1e93ed8939dd8a7c7d08765fdbe8be31` (#8183 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-24 08:57:08 +01:00
Ettore Di Giacinto	ca65fc751a	chore(model gallery): add qwen3-tts to model gallery (#8187 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-23 23:06:50 +01:00
LocalAI [bot]	a1e3acc590	docs: ⬆️ update docs version mudler/LocalAI (#8182 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-23 22:03:47 +01:00
Ettore Di Giacinto	a36960e069	fix(qwen-tts): change icon URL in index.yaml Updated the icon URL for the project. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-01-23 22:00:14 +01:00
Ettore Di Giacinto	58bb6a29ed	Revert "chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/bark in the pip group across 1 directory" (#8180 ) Revert "chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/ba…" This reverts commit `5881c82413`.	2026-01-23 17:25:04 +01:00
dependabot[bot]	5881c82413	chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/bark in the pip group across 1 directory (#8175 ) chore(deps): bump torch Bumps the pip group with 1 update in the /backend/python/bark directory: torch. Updates `torch` from 2.4.1 to 2.7.1+xpu --- updated-dependencies: - dependency-name: torch dependency-version: 2.7.1+xpu dependency-type: direct:production dependency-group: pip ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-01-23 15:32:15 +00:00
Ettore Di Giacinto	923ebbb344	feat(qwen-tts): add Qwen-tts backend (#8163 ) * feat(qwen-tts): add Qwen-tts backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update intel deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop flash-attn for cuda13 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> v3.10.1	2026-01-23 15:18:41 +01:00
LocalAI [bot]	ea51567b89	chore(model gallery): 🤖 add 1 new models via gallery agent (#8170 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-23 08:19:39 +01:00
LocalAI [bot]	552c62a19c	chore: ⬆️ Update leejet/stable-diffusion.cpp to `5e4579c11d0678f9765463582d024e58270faa9c` (#8166 ) ⬆️ Update leejet/stable-diffusion.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-01-23 08:18:05 +01:00
Ettore Di Giacinto	c0b21a921b	feat: detect thinking support from backend automatically if not explicitly set (#8167 ) detect thinking support from backend automatically if not explicitly set Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-23 00:38:28 +01:00
LocalAI [bot]	b10045adc2	chore: ⬆️ Update ggml-org/llama.cpp to `a5eaa1d6a3732bc0f460b02b61c95680bba5a012` (#8165 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-01-22 23:32:05 +00:00
Ettore Di Giacinto	61b5e3b629	chore: drop test file Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-22 22:19:38 +00:00
Ettore Di Giacinto	e35d7cb3b3	chore: drop test file the function now was removed Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-22 21:47:52 +00:00
Ettore Di Giacinto	0fa0ac4797	fix(videogen): drop incomplete endpoint, add GGUF support for LTX-2 (#8160 ) * Debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop openai video endpoint (is not complete) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add download button Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-22 14:09:20 +01:00
LocalAI [bot]	be7ed85838	chore(model gallery): 🤖 add 1 new models via gallery agent (#8157 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-22 08:25:40 +01:00
LocalAI [bot]	c12b310028	chore: ⬆️ Update ggml-org/llama.cpp to `c301172f660a1fe0b42023da990bf7385d69adb4` (#8151 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-22 00:51:22 +01:00
LocalAI [bot]	0447d5564d	chore: ⬆️ Update leejet/stable-diffusion.cpp to `329571131d62d64a4f49e1acbef49ae02544fdcd` (#8152 ) ⬆️ Update leejet/stable-diffusion.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-22 00:50:41 +01:00
Ettore Di Giacinto	22c0eb5421	chore(diffusers): add 'av' to requirements.txt (#8155 ) Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-01-21 22:35:00 +01:00
LocalAI [bot]	a0a00fb937	chore(model-gallery): ⬆️ update checksum (#8153 ) ⬆️ Checksum updates in gallery/index.yaml Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-21 21:45:11 +01:00
LocalAI [bot]	6dd44742ea	feat(swagger): update swagger (#8150 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-21 21:44:44 +01:00
Richard Palethorpe	00c72e7d3e	fix(tracing): Create trace buffer on first request to enable tracing at runtime (#8148 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-01-21 18:39:39 +01:00
LocalAI [bot]	d01c335cf6	chore: ⬆️ Update ggml-org/whisper.cpp to `7aa8818647303b567c3a21fe4220b2681988e220` (#8146 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-21 17:44:01 +01:00
LocalAI [bot]	5687df4535	chore: ⬆️ Update ggml-org/llama.cpp to `ad8d85bd94cc86e89d23407bdebf98f2e6510c61` (#8145 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-21 15:41:36 +00:00
Ettore Di Giacinto	f5fade97e6	chore: drop noisy logs (#8142 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-21 09:52:20 +01:00
Ettore Di Giacinto	b88ae31e4e	chore(model gallery): add flux 2 and flux 2 klein (#8141 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-21 09:46:33 +01:00
Ettore Di Giacinto	f6daaa7c35	chore(deps): Bump llama.cpp to '1c7cf94b22a9dc6b1d32422f72a627787a4783a3' (#8136 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-21 00:12:13 +01:00
Ettore Di Giacinto	c491c6ca90	feat(openresponses): Support reasoning blocks (#8133 ) * feat(openresponses): support reasoning blocks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * allow to disable reasoning, refactor common logic Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add option to only strip reasoning Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add configurations for custom reasoning tokens Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-21 00:11:45 +01:00
Ettore Di Giacinto	34e054f607	fix(reasoning): support models with reasoning without starting thinking tag (#8132 ) * chore: extract reasoning to its own package Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * make sure we detect thinking tokens from template Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to override via config, add tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-20 21:07:59 +01:00
LocalAI [bot]	e886bb291a	chore(model gallery): 🤖 add 1 new models via gallery agent (#8128 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-20 12:58:29 +01:00
Ettore Di Giacinto	4bf2f8bbd8	chore(docs): update docs with Anthropic API and openresponses Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-20 09:25:24 +01:00
LocalAI [bot]	d3525b7509	chore: ⬆️ Update ggml-org/llama.cpp to `959ecf7f234dc0bc0cd6829b25cb0ee1481aa78a` (#8122 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-19 22:50:47 +01:00
LocalAI [bot]	c8aa821e0e	chore: ⬆️ Update leejet/stable-diffusion.cpp to `a48b4a3ade9972faf0adcad47e51c6fc03f0e46d` (#8121 ) ⬆️ Update leejet/stable-diffusion.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-19 22:27:46 +01:00
dependabot[bot]	b3191927ae	chore(deps): bump github.com/mudler/cogito from 0.7.2 to 0.8.1 (#8124 ) Bumps [github.com/mudler/cogito](https://github.com/mudler/cogito) from 0.7.2 to 0.8.1. - [Release notes](https://github.com/mudler/cogito/releases) - [Commits](https://github.com/mudler/cogito/compare/v0.7.2...v0.8.1) --- updated-dependencies: - dependency-name: github.com/mudler/cogito dependency-version: 0.8.1 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-01-19 22:26:26 +01:00
LocalAI [bot]	54c5a2d9ea	docs: ⬆️ update docs version mudler/LocalAI (#8120 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-19 21:18:24 +00:00
Ettore Di Giacinto	0279591fec	Enable reranking for Qwen3-VL-Reranker-8B Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-01-19 15:28:58 +01:00
LocalAI [bot]	8845186955	chore: ⬆️ Update leejet/stable-diffusion.cpp to `2efd19978dd4164e387bf226025c9666b6ef35e2` (#8099 ) ⬆️ Update leejet/stable-diffusion.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-18 22:40:35 +01:00
LocalAI [bot]	ab8ed24358	chore: ⬆️ Update ggml-org/llama.cpp to `287a33017b32600bfc0e81feeb0ad6e81e0dd484` (#8100 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-18 22:40:14 +01:00
LocalAI [bot]	a021df5a88	feat(swagger): update swagger (#8098 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-18 22:10:06 +01:00
Ettore Di Giacinto	5f403b1631	chore: drop neutts for l4t (#8101 ) Builds exhausts CI currently, and there are better backends at this point in time. We will probably deprecate it in the future. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> v3.10.0	2026-01-18 21:55:56 +01:00
rampa3	897ad1729e	chore(model gallery): add qwen3-coder-30b-a3b-instruct based on model request (#8082 ) * chore(model gallery): add qwen3-coder-30b-a3b-instruct based on model request Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com> * added missing model config import URL Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com> --------- Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com>	2026-01-18 09:23:07 +01:00
LocalAI [bot]	16a18a2e55	chore: ⬆️ Update leejet/stable-diffusion.cpp to `9565c7f6bd5fcff124c589147b2621244f2c4aa1` (#8086 ) ⬆️ Update leejet/stable-diffusion.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-17 22:12:21 +01:00
Ettore Di Giacinto	3387bfaee0	feat(api): add support for open responses specification (#8063 ) * feat: openresponses Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add ttl settings, fix tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: register cors middleware by default Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * satisfy schema Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Logitbias and logprobs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add grammar Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * SSE compliance Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * tool JSON conversion Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * support background mode Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * swagger Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * drop code. This is handled in the handler Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small refactorings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * background mode for MCP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-17 22:11:47 +01:00
LocalAI [bot]	1cd33047b4	chore: ⬆️ Update ggml-org/llama.cpp to `2fbde785bc106ae1c4102b0e82b9b41d9c466579` (#8087 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-17 21:10:18 +00:00
Ettore Di Giacinto	1de045311a	chore(ui): add video generation link (#8079 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-17 09:49:47 +01:00
LocalAI [bot]	5fe9bf9f84	chore: ⬆️ Update ggml-org/whisper.cpp to `f53dc74843e97f19f94a79241357f74ad5b691a6` (#8074 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-17 08:32:53 +01:00
LocalAI [bot]	d4fd0c0609	chore: ⬆️ Update ggml-org/llama.cpp to `388ce822415f24c60fcf164a321455f1e008cafb` (#8073 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-16 21:22:33 +00:00
Ettore Di Giacinto	d16722ee13	Revert "chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/python/rerankers in the pip group across 1 directory" (#8072 ) Revert "chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/pyt…" This reverts commit `1f10ab39a9`.	2026-01-16 20:50:33 +01:00

1 2 3 4 5 ...

5427 Commits