Commit Graph

  • a6d155b195 fix(sglang): patch CMakeLists.txt instead of CXXFLAGS for AVX-512 feat/sglang Ettore Di Giacinto 2026-04-16 07:36:34 +00:00
  • 7f88a3ba30 chore: ⬆️ Update leejet/stable-diffusion.cpp to c41c5ded7af85e01b7fe442ff7950c720706d53a (#9366) master LocalAI [bot] 2026-04-16 09:04:33 +02:00
  • c4f309388e fix(gallery): correct gemma-4 model URIs returning 404 (#9379) Matt Van Horn 2026-04-16 02:51:20 -04:00
  • ebbe3ee0e0 chore(deps): bump torch dependabot/pip/backend/python/ace-step/pip-f98efa472c dependabot[bot] 2026-04-16 06:25:05 +00:00
  • 146f69980a chore(deps): bump the pip group across 1 directory with 2 updates dependabot/pip/backend/python/coqui/pip-842ffb7a79 dependabot[bot] 2026-04-16 06:24:39 +00:00
  • efae3fd97b chore(deps): bump dompurify dependabot/npm_and_yarn/core/http/react-ui/npm_and_yarn-2a73d1bbcf dependabot[bot] 2026-04-16 06:24:27 +00:00
  • ab326a9c61 chore(deps): bump the npm_and_yarn group across 1 directory with 6 updates (#9373) dependabot[bot] 2026-04-16 08:23:03 +02:00
  • df2d25cee5 chore: ⬆️ Update ikawrakow/ik_llama.cpp to 1163af96cf6bb4a4b819f998f84c153a49768b99 (#9368) LocalAI [bot] 2026-04-16 01:13:08 +02:00
  • 96cd561d9d chore: ⬆️ Update ggml-org/llama.cpp to b3d758750a268bf93f084ccfa3060fb9a203192a (#9370) LocalAI [bot] 2026-04-16 01:12:39 +02:00
  • 08445b1b89 chore(model-gallery): ⬆️ update checksum (#9369) LocalAI [bot] 2026-04-16 01:12:01 +02:00
  • 06b5b93556 fix(sglang): force AVX-512 CXXFLAGS and disable CI e2e job Ettore Di Giacinto 2026-04-15 07:28:21 +00:00
  • d47e2aa93f feat(backends): add sglang Ettore Di Giacinto 2026-04-14 22:38:52 +00:00
  • ad3c8c4832 fix(agents): handle embedding model dim changes on collection upload (#9365) Ettore Di Giacinto 2026-04-15 20:05:28 +02:00
  • 6f0051301b feat(backend): add tinygrad multimodal backend (experimental) (#9364) Ettore Di Giacinto 2026-04-15 19:48:23 +02:00
  • 8487058673 chore(model-gallery): ⬆️ update checksum (#9358) LocalAI [bot] 2026-04-15 01:25:59 +02:00
  • 62862ca06b chore: ⬆️ Update ggml-org/llama.cpp to fae3a28070fe4026f87bd6a544aba1b2d1896566 (#9357) LocalAI [bot] 2026-04-15 01:25:41 +02:00
  • 07e244d869 feat(swagger): update swagger (#9356) LocalAI [bot] 2026-04-15 01:25:24 +02:00
  • 95efb8a562 feat(backend): add turboquant llama.cpp-fork backend (#9355) Ettore Di Giacinto 2026-04-15 01:25:04 +02:00
  • 410d100cc3 chore(ui): improve visibility of forms, color palette Ettore Di Giacinto 2026-04-14 21:53:03 +00:00
  • 833b7e8557 chore(docs): update transcription endpoint Ettore Di Giacinto 2026-04-14 14:14:46 +00:00
  • 87e6de1989 feat: wire transcription for llama.cpp, add streaming support (#9353) Ettore Di Giacinto 2026-04-14 16:13:40 +02:00
  • b361d2ddd6 chore(gallery): add new llama.cpp supported models (qwen-asr, ocr) Ettore Di Giacinto 2026-04-14 10:04:50 +00:00
  • 1e4c4577bb fix(ci): small fixups Ettore Di Giacinto 2026-04-14 09:27:27 +00:00
  • 98fd9d5cc6 chore(deps): bump github.com/charmbracelet/glamour from 0.10.0 to 1.0.0 (#9340) dependabot[bot] 2026-04-14 11:17:05 +02:00
  • 0c725f5702 chore(deps): bump github.com/swaggo/echo-swagger from 1.4.1 to 1.5.2 (#9344) dependabot[bot] 2026-04-14 11:15:37 +02:00
  • 7661a4ffa5 chore(deps): bump github.com/testcontainers/testcontainers-go/modules/nats from 0.41.0 to 0.42.0 (#9341) dependabot[bot] 2026-04-14 11:15:26 +02:00
  • 24ad6e4be1 chore(deps): bump github.com/google/go-containerregistry from 0.21.3 to 0.21.5 (#9343) dependabot[bot] 2026-04-14 11:15:09 +02:00
  • c0648b8836 chore: ⬆️ Update ikawrakow/ik_llama.cpp to 55d3c05bf7b377deaa5dc84d255d9740a345a206 (#9348) LocalAI [bot] 2026-04-14 08:56:25 +02:00
  • a05c7def59 fix(e2e): update to new testcontainers Ettore Di Giacinto 2026-04-14 06:56:04 +00:00
  • 906acba8db chore: ⬆️ Update ggml-org/llama.cpp to e97492369888f5311e4d1f3beb325a36bbed70e9 (#9347) LocalAI [bot] 2026-04-14 08:54:25 +02:00
  • 4226ca4aee chore(deps): bump sentence-transformers from 5.2.3 to 5.4.0 in /backend/python/transformers (#9342) dependabot[bot] 2026-04-14 00:30:27 +02:00
  • c6d5dc3374 chore(model-gallery): ⬆️ update checksum (#9346) LocalAI [bot] 2026-04-13 23:00:13 +02:00
  • 7ce675af21 chore(gallery-agent): extract readme Ettore Di Giacinto 2026-04-13 20:31:49 +00:00
  • be1b8d56c9 fix(gallery): override parameters for flux kontext Ettore Di Giacinto 2026-04-13 22:29:17 +02:00
  • 97f087ed31 chore(deps): bump github.com/testcontainers/testcontainers-go from 0.41.0 to 0.42.0 (#9338) dependabot[bot] 2026-04-13 21:54:02 +02:00
  • 8691bbe663 chore(deps): bump actions/upload-pages-artifact from 4 to 5 (#9337) dependabot[bot] 2026-04-13 21:53:47 +02:00
  • 7998f96f11 chore(deps): bump softprops/action-gh-release from 2 to 3 (#9336) dependabot[bot] 2026-04-13 21:53:28 +02:00
  • cada97ee46 chore(gallery-agent): control bot via PR Ettore Di Giacinto 2026-04-13 19:52:48 +00:00
  • 3375ea1a2c chore(gallery-agent): simplify Ettore Di Giacinto 2026-04-13 19:50:31 +00:00
  • 0e7c0adee4 docs: document tool calling on vLLM and MLX backends Ettore Di Giacinto 2026-04-13 16:58:55 +00:00
  • 016da02845 feat: refactor shared helpers and enhance MLX backend functionality (#9335) Ettore Di Giacinto 2026-04-13 18:44:03 +02:00
  • daa0272f2e docs(agents): capture vllm backend lessons + runtime lib packaging (#9333) Ettore Di Giacinto 2026-04-13 11:09:57 +02:00
  • d67623230f feat(vllm): parity with llama.cpp backend (#9328) Ettore Di Giacinto 2026-04-13 11:00:29 +02:00
  • cd56a05c3e ci(vllm): disable tests-vllm-grpc job (heterogeneous runners) feat/vllm-parity Ettore Di Giacinto 2026-04-13 07:46:57 +00:00
  • 0f90d17aac feat(swagger): update swagger (#9329) LocalAI [bot] 2026-04-13 09:42:36 +02:00
  • ea32b8953f chore: ⬆️ Update ggml-org/llama.cpp to 1e9d771e2c2f1113a5ebdd0dc15bafe57dce64be (#9330) LocalAI [bot] 2026-04-13 09:42:18 +02:00
  • d74cd56b14 feat(vllm): bundle libnuma/libgomp via package.sh Ettore Di Giacinto 2026-04-12 20:20:21 +00:00
  • 017bdee4e4 ci(vllm): install libnuma1 + libgomp1 on bigger-runner Ettore Di Giacinto 2026-04-12 20:18:13 +00:00
  • c4dc495ea1 ci(vllm): install make + build deps on bigger-runner Ettore Di Giacinto 2026-04-12 20:08:09 +00:00
  • ea2bbabffd ci(vllm): use bigger-runner instead of source build Ettore Di Giacinto 2026-04-12 16:02:49 +00:00
  • bc7578bdb1 fix(hipblas): pin down rocm6.4 wheels on whisperx (7.x not supported) Ettore Di Giacinto 2026-04-12 15:27:51 +00:00
  • 329df11989 fix(vllm): build from source on CI to avoid SIGILL on prebuilt wheel Ettore Di Giacinto 2026-04-12 15:14:42 +00:00
  • c7f444d18b ci(test-extra): run vllm e2e tests on CPU Ettore Di Giacinto 2026-04-12 14:53:44 +00:00
  • e7f406169a test(e2e-backends): add tools capability + HF model name support Ettore Di Giacinto 2026-04-12 14:51:58 +00:00
  • 034a60bf76 ci(backend): build cpu-vllm container image Ettore Di Giacinto 2026-04-12 09:43:04 +00:00
  • c99188f106 fix(vllm): tool parser constructor compat + e2e tool calling test Ettore Di Giacinto 2026-04-12 09:15:16 +00:00
  • c2f73a987e fix(vllm): CPU build compatibility with vllm 0.14.1 Ettore Di Giacinto 2026-04-12 08:58:57 +00:00
  • b215843807 feat(vllm): CPU support + shared utils + vllm-omni feature parity Ettore Di Giacinto 2026-04-12 08:19:32 +00:00
  • 6786f05c64 feat(vllm): wire native tool/reasoning parsers + chat deltas + logprobs Ettore Di Giacinto 2026-04-12 08:19:14 +00:00
  • 6cf8263c30 feat(config): add vLLM parser defaults hook and importer auto-detection Ettore Di Giacinto 2026-04-12 08:11:46 +00:00
  • a30719f04a refactor(config): introduce backend hook system and migrate llama-cpp defaults Ettore Di Giacinto 2026-04-12 08:11:38 +00:00
  • 40b1c6f943 fix(schema): serialize ToolCallID and Reasoning in Messages.ToProto Ettore Di Giacinto 2026-04-12 08:11:24 +00:00
  • 9ca03cf9cc feat(backends): add ik-llama-cpp (#9326) Ettore Di Giacinto 2026-04-12 13:51:28 +02:00
  • 151ad271f2 feat(rocm): bump to 7.x (#9323) Ettore Di Giacinto 2026-04-12 08:51:30 +02:00
  • 2865f0f8d3 feat(ux): backend management enhancement (#9325) Ettore Di Giacinto 2026-04-12 00:35:22 +02:00
  • 5fe87cb0d5 feat: upgrade banner with Upgrade All button, detect pre-existing backends feat/backend-versioning Ettore Di Giacinto 2026-04-11 22:11:03 +00:00
  • 6fbda277c5 chore: ⬆️ Update ggml-org/llama.cpp to ff5ef8278615a2462b79b50abdf3cc95cfb31c6f (#9319) LocalAI [bot] 2026-04-11 23:15:23 +02:00
  • 7a0e6ae6d2 feat(qwen3tts.cpp): add new backend (#9316) Ettore Di Giacinto 2026-04-11 23:14:26 +02:00
  • e4bfc42a2d chore: ⬆️ Update leejet/stable-diffusion.cpp to 6b675a5ede9b0edf0a0f44191e8b79d7ef27615a (#9320) LocalAI [bot] 2026-04-11 23:07:30 +02:00
  • 7edd3ea96f chore(model-gallery): ⬆️ update checksum (#9321) LocalAI [bot] 2026-04-11 22:53:48 +02:00
  • b20a2f1cea feat(swagger): update swagger (#9318) LocalAI [bot] 2026-04-11 22:31:36 +02:00
  • 8ab0744458 feat: backend versioning, upgrade detection and auto-upgrade (#9315) Ettore Di Giacinto 2026-04-11 22:31:15 +02:00
  • 6dd37a95c4 test: add e2e tests for backend upgrade API Ettore Di Giacinto 2026-04-11 08:32:18 +00:00
  • ee00a10836 fix: use advisory lock for upgrade checker in distributed mode Ettore Di Giacinto 2026-04-11 08:24:03 +00:00
  • 948f3bfaa4 feat: add upgrade checker service, API endpoints, and CLI command Ettore Di Giacinto 2026-04-11 08:11:06 +00:00
  • 1e083cd870 feat(ui): add backend version display and upgrade support Ettore Di Giacinto 2026-04-11 08:02:49 +00:00
  • b19e60d03a feat: add AutoUpgradeBackends config and runtime settings Ettore Di Giacinto 2026-04-11 07:58:14 +00:00
  • 4d463e9f0d feat: add backend upgrade detection and execution logic Ettore Di Giacinto 2026-04-11 07:51:39 +00:00
  • ae4ae5f425 feat: add backend versioning data model foundation Ettore Di Giacinto 2026-04-11 07:45:12 +00:00
  • 7c1865b307 Fix load of z-image-turbo (#9264) thelittlefireman 2026-04-11 08:42:13 +02:00
  • 62a674ce12 chore: ⬆️ Update ggml-org/llama.cpp to e62fa13c2497b2cd1958cb496e9489e86bbd5182 (#9312) LocalAI [bot] 2026-04-11 08:39:10 +02:00
  • c39213443b feat(swagger): update swagger (#9310) LocalAI [bot] 2026-04-11 08:38:55 +02:00
  • 606f462da4 chore: ⬆️ Update PABannier/sam3.cpp to 01832ef85fcc8eb6488f1d01cd247f07e96ff5a9 (#9311) LocalAI [bot] 2026-04-11 08:38:30 +02:00
  • 5c35e85fe2 feat: allow to pin models and skip from reaping (#9309) Ettore Di Giacinto 2026-04-11 08:38:17 +02:00
  • 062e0d0d00 feat: Add toggle mechanism to enable/disable models from loading on demand (#9304) Leigh Phillips 2026-04-10 09:17:41 -07:00
  • d4cd6c284f chore: ⬆️ Update ggml-org/llama.cpp to d132f22fc92f36848f7ccf2fc9987cd0b0120825 (#9302) LocalAI [bot] 2026-04-10 08:46:45 +02:00
  • 3bb8b65d31 chore(qwen3-asr): pass prompt as context to transcribe (#9301) Ettore Di Giacinto 2026-04-10 08:45:59 +02:00
  • 9748a1cbc6 fix(streaming): skip chat deltas for role-init elements to prevent first token duplication (#9299) Ettore Di Giacinto 2026-04-10 08:45:47 +02:00
  • 6bc76dda6d feat(swagger): update swagger (#9300) LocalAI [bot] 2026-04-10 01:05:53 +02:00
  • e1a6010874 fix(streaming): deduplicate tool call emissions during streaming (#9292) Ettore Di Giacinto 2026-04-10 00:44:25 +02:00
  • 706cf5d43c feat(sam.cpp): add sam.cpp detection backend (#9288) Ettore Di Giacinto 2026-04-09 21:49:11 +02:00
  • 13a6ed709c fix: thinking models with tools returning empty content (reasoning-only retry loop) (#9290) Ettore Di Giacinto 2026-04-09 18:30:31 +02:00
  • 85be4ff03c feat(api): add ollama compatibility (#9284) Ettore Di Giacinto 2026-04-09 14:15:14 +02:00
  • b0d9ce4905 Remove header from OpenAI Realtime API documentation Ettore Di Giacinto 2026-04-09 09:00:28 +02:00
  • 7081b54c09 chore: ⬆️ Update leejet/stable-diffusion.cpp to e8323cabb0e4511ba18a50b1cb34cf1f87fc71ef (#9281) LocalAI [bot] 2026-04-09 08:12:23 +02:00
  • 2b05420f95 chore(llama.cpp): bump to 'd12cc3d1ca6bba741cd77887ac9c9ee18c8415c7' (#9282) Ettore Di Giacinto 2026-04-09 08:12:05 +02:00
  • b64347b6aa chore: add gemma4 to the gallery Ettore Di Giacinto 2026-04-08 23:44:16 +00:00
  • e00ce981f0 fix: try to add whisperx and faster-whisper for more variants (#9278) Ettore Di Giacinto 2026-04-08 21:23:38 +02:00
  • 285f7d4340 chore: add embeddingemma Ettore Di Giacinto 2026-04-08 17:40:55 +00:00
  • ea6e850809 feat: Add Kokoros backend (#9212) Richard Palethorpe 2026-04-08 18:23:16 +01:00