LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-07-02 12:26:49 -04:00

Author	SHA1	Message	Date
Ettore Di Giacinto	4ea461c330	fix(ui): correctly map watchdog fields (#9022 ) Fixes: https://github.com/mudler/LocalAI/issues/9018 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-15 22:12:24 +01:00
Ettore Di Giacinto	042a9b8ef6	Remove Table of Contents from README Removed the Table of Contents section from the README. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-15 21:49:13 +01:00
Ettore Di Giacinto	65f1a4154a	Revise README structure and content Updated the README to include new sections and remove outdated content. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-15 21:47:08 +01:00
LocalAI [bot]	c6a51289b0	fix: Automatically disable mmap for Intel SYCL backends (#9012 ) (#9015 ) * fix: Automatically disable mmap for Intel SYCL backends Fixes issue #9012 where Qwen3.5 models fail to load on Intel Arc GPU with RPC EOF error. The Intel SYCL backend has a known issue where mmap enabled causes the backend to hang. This change automatically disables mmap when detecting Intel or SYCL backends. References: - https://github.com/mudler/LocalAI/issues/9012 - Documentation mentions: SYCL hangs when mmap: true is set * feat: Add logging for mmap auto-disable on Intel SYCL backends As requested in PR review, add xlog.Info call to log when mmap is automatically disabled for Intel SYCL backends. This helps with debugging and confirms the auto-disable logic is working. --------- Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-15 21:06:35 +01:00
LocalAI [bot]	87525109f1	chore: ⬆️ Update ggml-org/llama.cpp to `3a6f059909ed5dab8587df5df4120315053d57a4` (#9009 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-15 09:46:45 +01:00
Ettore Di Giacinto	c596d8a5d9	fix: Change baseDir assignment to use ModelPath (#9010 ) Fixes: https://github.com/mudler/LocalAI/issues/9005 Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-15 09:45:58 +01:00
LocalAI [bot]	d79ad76e48	docs: ⬆️ update docs version mudler/LocalAI (#9008 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-15 08:39:49 +01:00
Ettore Di Giacinto	dde0353432	chore(api): add path to expose collection raw files Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-14 22:31:45 +00:00
Ettore Di Giacinto	8e8b7df715	fix(ui): do not let from button to trigger Signed-off-by: Ettore Di Giacinto <mudler@localai.io> v4.0.0	2026-03-14 17:35:04 +00:00
Ettore Di Giacinto	5affb747a9	chore: drop AIO images (#9004 ) AIO images are behind, and takes effort to maintain these. Wizard and installation of models have been semplified massively, so AIO images lost their purpose. This allows us to be more laser focused on main images and reliefes stress from CI. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-14 17:49:36 +01:00
Ettore Di Giacinto	0ac4ac5bdd	chore: configure data volume Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-14 14:46:52 +00:00
Ettore Di Giacinto	0725b6f334	chore: bump dependencies Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-14 10:14:42 +00:00
Ettore Di Giacinto	0a856c9bae	chore: bump dependencies Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-14 10:06:05 +00:00
LocalAI [bot]	977063c4ba	chore: ⬆️ Update ggml-org/llama.cpp to `e30f1fdf74ea9238ff562901aa974c75aab6619b` (#8997 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-14 01:16:42 +01:00
LocalAI [bot]	0ec3ea4a46	fix(acestep-cpp): resolve relative model paths in options (#8993 ) * fix(acestep-cpp): resolve relative model paths in options The acestep-cpp backend was failing to load models because the model paths in options (text_encoder_model, dit_model, vae_model) were being passed to the C++ code without resolving their relative paths. When a user configures acestep-cpp-turbo-4b, the model paths are specified as relative paths like 'acestep-cpp/acestep-v15-turbo-Q8_0.gguf'. The backend was passing these paths directly to the C++ code without joining them with the model directory. This fix: 1. Gets the base directory from the ModelFile path 2. Resolves all relative paths in options to be absolute paths 3. Adds debug logging to show resolved paths for troubleshooting Fixes #8991 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * test: fix acestep tests to not join modeldir in options According to code review feedback, the Options array in TestLoadModel and TestSoundGeneration should contain just the model filenames without filepath.Join with modelDir. The model paths are handled internally by the backend. * fix: change bpm parameter type to float32 to match C++ API signature * test: fix TestLoadModel and TestSoundGeneration to use baseDir for model paths - Modified TestLoadModel to compute baseDir from main model path and use it for relative model paths - Modified TestSoundGeneration similarly to use baseDir for model paths - Changed bpm parameter type from int32 to float32 to match C++ API * Apply suggestions from code review Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-14 01:16:13 +01:00
Richard Palethorpe	87b3e10024	fix(flux.2-klein-9b): Use Qwen3-8b to avoid GGML assertion failure on tensor mismatch (#8995 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-13 21:39:31 +01:00
Richard Palethorpe	e4c1c39727	fix(conf): Don't overwrite env provided galleries with runtime conf (#8994 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-13 21:39:03 +01:00
Richard Palethorpe	ed2c6da4bf	fix(ui): Move routes to /app to avoid conflict with API endpoints (#8978 ) Also test for regressions in HTTP GET API key exempted endpoints because this list can get out of sync with the UI routes. Also fix support for proxying on a different prefix both server and client side. Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-13 21:38:18 +01:00
Richard Palethorpe	f9a850c02a	feat(realtime): WebRTC support (#8790 ) * feat(realtime): WebRTC support Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(tracing): Show full LLM opts and deltas Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-13 21:37:15 +01:00
Ettore Di Giacinto	4e3bf2752d	fix(tests): Remove 'huggingface' substring expectation Remove check for 'huggingface' in response body. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-13 13:12:04 +01:00
Ettore Di Giacinto	0b53bc5061	fix(ci): Delete leftovers huggingface backend from backend.yml Removed huggingface backend configuration from the workflow. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-13 09:59:43 +01:00
LocalAI [bot]	ff21bc6cbb	chore: ⬆️ Update ace-step/acestep.cpp to `5aa065445541094cba934299cd498bbb9fa5c434` (#8984 ) ⬆️ Update ace-step/acestep.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-13 07:59:34 +01:00
LocalAI [bot]	46a8941a2c	chore: ⬆️ Update ggml-org/llama.cpp to `57819b8d4b39d893408e51520dff3d47d1ebb757` (#8983 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-13 07:59:15 +01:00
LocalAI [bot]	c0351b8e6a	Remove HuggingFace backend support (#8971 ) * Remove HuggingFace backend support, restore other backends - Removed backend/go/huggingface directory and all related files - Removed pkg/langchain/huggingface.go - Removed LCHuggingFaceBackend from pkg/model/initializers.go - Removed huggingface backend entries from backend/index.yaml - Updated backend/README.md to remove HuggingFace backend reference - Restored kitten-tts, local-store, silero-vad, piper backends that were incorrectly removed This change removes only HuggingFace backend support from LocalAI as per the P0 priority request in issue #8963, while preserving other backends (kitten-tts, local-store, silero-vad, piper). Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> * Remove huggingface backend from test.yml build command The tests-linux CI job was failing because it was trying to build the huggingface backend which no longer exists after the backend removal. This removes huggingface from the build command in test.yml. * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-13 01:09:30 +01:00
Ettore Di Giacinto	ec91c477dc	Remove model descriptions from index.yaml Removed description fields from multiple model entries in index.yaml. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-12 21:43:21 +01:00
LocalAI [bot]	3b9abffdc8	chore(model-gallery): ⬆️ update checksum (#8985 ) ⬆️ Checksum updates in gallery/index.yaml Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-12 21:34:27 +01:00
Ettore Di Giacinto	6c11c54a3b	fix: avoid race condition Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-12 18:46:22 +00:00
Ettore Di Giacinto	13bd0d9944	fix(collections): start agent pool after http server (#8981 ) Otherwise if using collections with postgresql we create a deadlock, as we need embeddings to be up Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-12 19:25:49 +01:00
Ettore Di Giacinto	a738f8b0e4	feat(backends): add ace-step.cpp (#8965 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-12 18:56:26 +01:00
Ettore Di Giacinto	8f3efaed15	Update model entry in index.yaml Removed description and license fields from model entry. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-12 18:51:29 +01:00
LocalAI [bot]	c4cccb728e	chore(model gallery): 🤖 add 1 new models via gallery agent (#8980 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-12 18:51:00 +01:00
Ettore Di Giacinto	b209947e81	Remove Qwen3.5-35B model and update model path Removed deprecated Qwen3.5-35B-A3B model configuration and updated model path for Qwen3. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-12 18:19:55 +01:00
Ettore Di Giacinto	14e82d76f9	chore(ui): improve errors and reporting during model installation (#8979 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-12 18:19:06 +01:00
LocalAI [bot]	f73a158153	docs: Document GPU auto-fit mode limitations and trade-offs (closes #8562 ) (#8954 ) * docs: Add documentation about GPU auto-fit mode limitations (closes #8562) - Document the default gpu_layers behavior (9999999) that disables auto-fit - Explain the trade-off between auto-fit and VRAM threshold unloading - Add recommendations for users who want to enable gpu_layers: -1 - Note known issues with tensor_buft_override buffer errors - Link to issue #8562 for future improvements Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-12 13:35:31 +01:00
Richard Palethorpe	b24ca51287	fix(llama-cpp): Set enable_thinking in the correct place (#8973 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-12 13:32:29 +01:00
Sertaç Özercan	45d18813bd	fix: gate CUDA directory checks on GPU vendor to prevent false CUDA detection (#8942 ) Container images that install CUDA runtime libraries (e.g., cuda-cudart-12-5 via apt) create /usr/local/cuda-12 directories as a side effect. The previous code checked for these directories before checking whether a GPU was present, causing CPU-only hosts to select a CUDA backend that crashes because libcuda.so.1 is absent. Reorder checks so CUDA directory existence only refines the capability when an NVIDIA GPU is actually detected, consistent with the arm64 L4T code path. Signed-off-by: Sertac Ozercan <sozercan@gmail.com>	2026-03-12 07:53:39 +01:00
Ettore Di Giacinto	7dc691c171	feat: add fish-speech backend (#8962 ) * feat: add fish-speech backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * drop portaudio Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-12 07:48:23 +01:00
Ettore Di Giacinto	17f36e73b5	Remove model name entries from index.yaml Removed 'name' entries for various models in the index file. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-12 01:13:58 +01:00
Ettore Di Giacinto	031909d85a	Clean up gallery index by removing obsolete models Removed multiple models and their associated metadata from the gallery index. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-12 00:55:19 +01:00
LocalAI [bot]	996dd7652f	feat(swagger): update swagger (#8961 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-11 22:01:55 +01:00
LocalAI [bot]	bf4f8da266	fix: include model name in mmproj file path to prevent model isolation (#8937 ) (#8940 ) * fix: include model name in mmproj file path to prevent model isolation issues This fix addresses issue #8937 where different models with mmproj files having the same filename (e.g., mmproj-F32.gguf) would overwrite each other. By including the model name in the path (llama-cpp/mmproj/<model-name>/<filename>), each model's mmproj files are now stored in separate directories, preventing the collision that caused conversations to fail when switching between models. Fixes #8937 Signed-off-by: LocalAI Bot <localai-bot@example.com> * test: update test expectations for model name in mmproj path The test file had hardcoded expectations for the old mmproj path format. Updated the test expectations to include the model name subdirectory to match the new path structure introduced in the fix. Fixes CI failures on tests-apple and tests-linux * fix: add model name to model path for consistency with mmproj path This change makes the model path consistent with the mmproj path by including the model name subdirectory in both paths: - mmproj: llama-cpp/mmproj/<model-name>/<filename> - model: llama-cpp/models/<model-name>/<filename> This addresses the reviewer's feedback that the model config generation needs to correctly reference the mmproj file path. Fixes the issue where the model path didn't include the model name subdirectory while the mmproj path did. Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> --------- Signed-off-by: LocalAI Bot <localai-bot@example.com> Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Co-authored-by: team-coding-agent-1 <team-coding-agent-1@localai.dev>	2026-03-11 10:28:37 +01:00
Attila Györffy	5a67b5d73c	Fix image upload processing and img2img pipeline in diffusers backend (#8879 ) * fix: add missing bufio.Flush in processImageFile The processImageFile function writes decoded image data (from base64 or URL download) through a bufio.NewWriter but never calls Flush() before closing the underlying file. Since bufio's default buffer is 4096 bytes, small images produce 0-byte files and large images are truncated — causing PIL to fail with "cannot identify image file". This breaks all image input paths: file, files, and ref_images parameters in /v1/images/generations, making img2img, inpainting, and reference image features non-functional. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> * fix: merge options into kwargs in diffusers GenerateImage The GenerateImage method builds a local `options` dict containing the source image (PIL), negative_prompt, and num_inference_steps, but never merges it into `kwargs` before calling self.pipe(*kwargs). This causes img2img to fail with "Input is in incorrect format" because the pipeline never receives the image parameter. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> test: add unit test for processImageFile base64 decoding Verifies that a base64-encoded PNG survives the write path (encode → decode → bufio.Write → Flush → file on disk) with byte-for-byte fidelity. The test image is small enough to fit entirely in bufio's 4096-byte buffer, which is the exact scenario where the missing Flush() produced a 0-byte file. Also tests that invalid base64 input is handled gracefully. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> * test: verify GenerateImage merges options into pipeline kwargs Mocks the diffusers pipeline and calls GenerateImage with a source image and negative prompt. Asserts that the pipeline receives the image, negative_prompt, and num_inference_steps via kwargs — the exact parameters that were silently dropped before the fix. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> * fix: move kwargs.update(options) earlier in GenerateImage Move the options merge right after self.options merge (L742) so that image, negative_prompt, and num_inference_steps are available to all downstream code paths including img2vid and txt2vid. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> * test: convert processImageFile tests to ginkgo Replace standard testing with ginkgo/gomega to be consistent with the rest of the test suites in the project. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> --------- Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-11 08:05:50 +01:00
LocalAI [bot]	270eb956c7	chore: ⬆️ Update ggml-org/llama.cpp to `10e5b148b061569aaee8ae0cf72a703129df0eab` (#8946 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-11 08:04:09 +01:00
Ettore Di Giacinto	8818452d85	feat(ui): MCP Apps, mcp streaming and client-side support (#8947 ) * Revert "fix: Add timeout-based wait for model deletion completion (#8756)" This reverts commit `9e1b0d0c82`. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: add mcp prompts and resources Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(ui): add client-side MCP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(ui): allow to authenticate MCP servers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(ui): add MCP Apps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: update AGENTS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: allow to collapse navbar, save state in storage Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(ui): add MCP button also to home page Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(chat): populate string content Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-11 07:30:49 +01:00
LocalAI [bot]	79f90de935	chore(model-gallery): ⬆️ update checksum (#8945 ) ⬆️ Checksum updates in gallery/index.yaml Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-10 21:43:52 +01:00
LocalAI [bot]	bda826d005	chore(model gallery): 🤖 add 1 new models via gallery agent (#8939 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-10 18:09:11 +01:00
Ettore Di Giacinto	89076bab92	fix(ui): wrap struct to pass metadata fields Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-10 08:48:25 +00:00
LocalAI [bot]	199fe89cfe	feat: Expand section index pages with comprehensive navigation (M7) (#8929 ) feat: expand section index pages with comprehensive navigation (M7) Co-authored-by: localai-bot <localai-bot@noreply.github.com>	2026-03-10 07:34:44 +01:00
LocalAI [bot]	de55ff3725	fix: correct grammar in CONTRIBUTING.md documentation section (#8932 ) Co-authored-by: localai-bot <localai-bot@noreply.github.com>	2026-03-10 07:33:11 +01:00
dependabot[bot]	6bdfefda96	chore(deps): bump node from 22-slim to 25-slim (#8922 ) Bumps node from 22-slim to 25-slim. --- updated-dependencies: - dependency-name: node dependency-version: 25-slim dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-10 07:31:03 +01:00

1 2 3 4 5 ...

5797 Commits