LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-07-05 13:57:28 -04:00

Author	SHA1	Message	Date
LocalAI [bot]	9297074caa	docs: expand GPU acceleration guide with L4T, multi-GPU, monitoring, and troubleshooting (#8858 ) - Expand multi-GPU section to cover llama.cpp (CUDA_VISIBLE_DEVICES, HIP_VISIBLE_DEVICES) in addition to diffusers - Add NVIDIA L4T/Jetson section with quick start commands and cross-reference to the dedicated ARM64 page - Add GPU monitoring section with vendor-specific tools (nvidia-smi, rocm-smi, intel_gpu_top) - Add troubleshooting section covering common issues: GPU not detected, CPU fallback, OOM errors, unsupported ROCm targets, SYCL mmap hang - Replace "under construction" warning with useful cross-references to related docs (container images, VRAM management) Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-08 21:59:57 +01:00
LocalAI [bot]	2133031b47	feat: Create comprehensive troubleshooting guide (M1 task) (#8856 ) * feat: create comprehensive troubleshooting guide (M1 task) - Consolidates troubleshooting information from scattered documentation - Covers installation, model loading, GPU/memory, API, performance, Docker, and network issues - Includes diagnostic commands and step-by-step solutions - Organized by category for easy navigation * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-08 21:58:32 +01:00
LocalAI [bot]	e026b513b2	feat: add MIT license badge to README.md (#8871 ) feat: add MIT license badge to README.md header - Add Shields.io license badge showing 'License: MIT' - Place badge in header section with other badges - Link badge to LICENSE file - Follows existing badge format (for-the-badge style) Co-authored-by: localai-bot <localai-bot@noreply.github.com>	2026-03-08 21:17:36 +01:00
LocalAI [bot]	2334556a8f	feat(cli): add configurable backend image fallback tags via CLI options (#8817 ) * feat(cli): add configurable backend image fallback tags via CLI options - Add three new CLI flags: --backend-images-release-tag, --backend-images-branch-tag, --backend-dev-suffix - Add corresponding fields to SystemState for passing configuration - Add WithBackendImagesReleaseTag, WithBackendImagesBranchTag, WithBackendDevSuffix options - Modify getFallbackTagValues to use SystemState instead of environment variables - Pass CLI options through to SystemState in run.go Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> * fix: add missing os import in core/gallery/backends.go Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> --------- Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-08 21:16:37 +01:00
LocalAI [bot]	05b7cce633	feat: add Events column to Agents list page (#8870 ) - Add 'Events' column header between 'Status' and 'Actions' - Fetch observable counts for each agent using /api/agents/<name>/observables - Display events count as clickable link navigating to agent status page - Events count updates every 5 seconds with agent refresh interval - Shows '0' if API call fails for an agent Co-authored-by: localai-bot <localai-bot@noreply.github.com>	2026-03-08 21:15:29 +01:00
LocalAI [bot]	ce33b7e6f8	docs: add comprehensive development setup instructions to CONTRIBUTING.md (H7) (#8860 ) * docs: add comprehensive development setup instructions to CONTRIBUTING.md - Expand prerequisites with Go version requirements and installation links - Add system dependencies for Ubuntu/Debian, CentOS/RHEL/Fedora, macOS, and Windows - Document build commands with explanations and key build variables - Add environment variables section with useful development env vars - Include development workflow guidelines (branch naming, commit format, PR process) - Enhance testing section with per-package and focused test instructions * Apply suggestions from code review Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-08 18:27:00 +01:00
LocalAI [bot]	9090bca920	feat: Add documentation for undocumented API endpoints (#8852 ) * feat: add documentation for undocumented API endpoints Creates comprehensive documentation for 8 previously undocumented endpoints: - Voice Activity Detection (/v1/vad) - Video Generation (/video) - Sound Generation (/v1/sound-generation) - Backend Monitor (/backend/monitor, /backend/shutdown) - Token Metrics (/tokenMetrics) - P2P endpoints (/api/p2p/* - 5 sub-endpoints) - System Info (/system, /version) Each documentation file includes HTTP method, request/response schemas, curl examples, sample JSON responses, and error codes. * docs: remove token-metrics endpoint documentation per review feedback The token-metrics endpoint is not wired into the HTTP router and should not be documented per reviewer request. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: move system-info documentation to reference section Per review feedback, system-info endpoint docs are better suited for the reference section rather than features. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-08 17:59:33 +01:00
LocalAI [bot]	ec8f2d7683	fix: Correct Talk Interface screenshot reference in README.md (H6) (#8857 ) fix: correct Talk Interface screenshot reference in README.md (H6) Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com>	2026-03-08 17:58:59 +01:00
LocalAI [bot]	ff02e7ff5b	docs: clarify SECURITY.md version support table with specific ranges and EOL dates (#8861 ) * docs: clarify SECURITY.md version support table with specific ranges and EOL dates - Add detailed version support table with 3.x (actively supported), 2.x (security fixes until Dec 31, 2026), and 1.x (EOL since Jan 1, 2024) - Define what each support level means for users - Add migration guidance for users on older versions - Replace vague version ranges with specific, actionable information Signed-off-by: localai-bot <localai-bot@noreply.github.com> * Apply suggestions from code review Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: localai-bot <localai-bot@noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-08 17:58:19 +01:00
LocalAI [bot]	85e4871d4d	chore: ⬆️ Update leejet/stable-diffusion.cpp to c8fb3d245858d495be1f140efdcfaa0d49de41e5 (#8841 ) * chore: ⬆️ update stable-diffusion.cpp to `c8fb3d245858d495be1f140efdcfaa0d49de41e5` Update stablediffusion-ggml to include fix for SD1 Pix2Pix issue (leejet/stable-diffusion.cpp#1329). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Signed-off-by: localai-bot <localai-bot@noreply.github.com> * fix: address CI failures in stablediffusion update Signed-off-by: localai-bot <localai-bot@noreply.github.com> * fix: resolve remaining CI failures in stablediffusion update - Move flow_shift to global scope so gen_image() can access the value set during load_model() (was causing compilation error) - Fix sd_type_str array: TQ1_0 should be at index 34, TQ2_0 at index 35 to match upstream SD_TYPE_TQ1_0=34, SD_TYPE_TQ2_0=35 enum values Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-08 09:53:08 +01:00
LocalAI [bot]	364ad30a2f	feat(downloader): add HF_MIRROR environment variable support (#8847 ) - Added HF_MIRROR env var to configure HuggingFace mirror URLs - HF_MIRROR takes precedence over HF_ENDPOINT for simpler mirror config - Supports both full URLs (https://hf-mirror.com) and simple hostnames (hf-mirror.com) - Auto-adds https:// if no scheme is provided - Also supports HF env var as an alias for HF_MIRROR Closes #8414 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-08 09:34:44 +01:00
Ettore Di Giacinto	d21369ad7b	Update shell completion documentation URL Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-08 09:33:36 +01:00
LocalAI [bot]	efd402207c	feat: Add shell completion support for bash, zsh, and fish (#8851 ) feat: add shell completion support for bash, zsh, and fish - Add core/cli/completion.go with dynamic completion script generation - Add core/cli/completion_test.go with unit tests - Modify cmd/local-ai/main.go to support completion command - Modify core/cli/cli.go to add Completion subcommand - Add docs/content/features/shell-completion.md with installation instructions The completion scripts are generated dynamically from the Kong CLI model, so they automatically include all commands, subcommands, and flags. Co-authored-by: localai-bot <localai-bot@noreply.github.com>	2026-03-08 09:32:39 +01:00
LocalAI [bot]	6a928e70bc	docs: add Table of Contents to README.md (#8846 ) docs: add Table of Contents to README.md for easier navigation - Add collapsible TOC with anchor links to all major sections - Include H2 sections and important H3 subsections - Place TOC after main description, before Local Stack Family - Use proper markdown anchor link format Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com>	2026-03-08 09:32:26 +01:00
LocalAI [bot]	36c184175a	docs: Add comprehensive API error reference documentation (#8848 ) docs: add comprehensive API error reference documentation Document all error response formats (OpenAI, Anthropic, Open Responses), HTTP status codes, per-endpoint error scenarios, and client error handling examples based on actual error handling code in the codebase. Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-08 08:54:51 +01:00
LocalAI [bot]	970079e68a	fix: Remove debug print statement from soundgeneration.go (C2) (#8843 ) fix: remove debug fmt.Printf statement from soundgeneration.go (#C2) Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com>	2026-03-08 08:49:29 +01:00
Weathercold	f347495de9	fix(qwen-tts): duplicate instruct argument in voice design mode (#8842 ) Don't pass instruct because it is added to kwargs Fixes the error `qwen_tts.inference.qwen3_tts_model.Qwen3TTSModel.generate_voice_design() got multiple values for keyword argument 'instruct'` Signed-off-by: Weathercold <weathercold.scr@proton.me>	2026-03-08 08:48:22 +01:00
LocalAI [bot]	23aa7aefed	chore(docs): Populate coding guidelines in CONTRIBUTING.md (#8840 ) docs: populate coding guidelines in CONTRIBUTING.md Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com>	2026-03-08 08:20:39 +01:00
LocalAI [bot]	1296167f84	chore: ⬆️ Update ggml-org/llama.cpp to `c5a778891ba0ddbd4cbb507c823f970595b1adc2` (#8837 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-07 23:28:06 +01:00
Ettore Di Giacinto	326f4bf4bc	chore(ci): drop voxcpm-cpu on aarm64 (torchcodec is not supported) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-07 22:24:56 +00:00
LocalAI [bot]	06125c338a	feat: update descriptions for first 9 models in gallery/index.yaml (#8831 ) feat: update descriptions for first 9 models in gallery/index.yaml from HuggingFace model cards - Updated qwen3.5-27b-claude-4.6-opus-reasoning-distilled-i1 with reasoning capabilities - Updated qwen3.5-4b-claude-4.6-opus-reasoning-distilled with reasoning capabilities - Updated q3.5-bluestar-27b with fine-tuned variant description - Updated qwen3.5-9b with multimodal capabilities - Updated qwen3.5-397b-a17b with large-scale model description - Updated qwen3.5-27b with performance-efficiency balance - Updated qwen3.5-122b-a10b with MoE architecture description - Updated qwen3.5-35b-a3b with MoE architecture description - Updated qwen3-next-80b-a3b-thinking with next-gen model description Descriptions sourced from HuggingFace model API metadata. Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-07 12:34:29 +01:00
LocalAI [bot]	73158600c8	chore(model gallery): 🤖 add 1 new models via gallery agent (#8830 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-07 09:17:30 +01:00
LocalAI [bot]	40b8f6270e	chore(model gallery): 🤖 add 1 new models via gallery agent (#8828 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-07 08:46:57 +01:00
Ettore Di Giacinto	ac48867b7d	feat: add agentic management (#8820 ) * feat: add standalone and agentic functionalities Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose agents via responses api Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-07 00:03:08 +01:00
LocalAI [bot]	e1df6807dc	chore: ⬆️ Update ggml-org/llama.cpp to `566059a26b0ce8faec4ea053605719d399c64cc5` (#8822 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-06 23:53:23 +01:00
LocalAI [bot]	ab315f2725	feat: Add LOCALAI_DISABLE_MCP environment variable to disable MCP support (#8816 ) * feat: Add LOCALAI_DISABLE_MCP environment variable to disable MCP support - Added DisableMCP field to RunCMD struct in core/cli/run.go - Added LOCALAI_DISABLE_MCP environment variable support - Added DisableMCP field to ApplicationConfig struct - Added DisableMCP AppOption function - Updated MCP endpoint routing to check appConfig.DisableMCP - When LOCALAI_DISABLE_MCP is set to true/1/yes, MCP endpoints are not registered When set, all MCP functionality is disabled and appropriate error messages are returned to users. Use Cases: - Security-conscious deployments where MCP is not needed - Reducing attack surface - Compliance requirements that prohibit certain protocol support Environment variable: LOCALAI_DISABLE_MCP=true Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> * docs: Add documentation for LOCALAI_DISABLE_MCP environment variable - Add section explaining how to disable MCP support using environment variable - Document use cases for disabling MCP - Provide examples for CLI and Docker usage Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> --------- Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-06 20:44:03 +01:00
BitToby	96efa4fce0	feat: add WebSocket mode support for the response api (#8676 ) * feat: add WebSocket mode support for the response api Signed-off-by: bittoby <218712309+bittoby@users.noreply.github.com> * test: add e2e tests for WebSocket Responses API Signed-off-by: bittoby <218712309+bittoby@users.noreply.github.com> --------- Signed-off-by: bittoby <218712309+bittoby@users.noreply.github.com>	2026-03-06 10:36:59 +00:00
Ettore Di Giacinto	e82b861961	fix(ui): do not lock all components during load Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-06 09:35:01 +01:00
LocalAI [bot]	6a15be377a	chore: Add LTX-2.3 model to gallery (#8805 ) feat: Add LTX-2.3 model to gallery - Add new entry for LTX-2.3 from Lightricks - Follows the same structure as existing LTX-2 entry - References: https://huggingface.co/Lightricks/LTX-2.3 Co-authored-by: localai-bot <localai-bot@example.com>	2026-03-06 09:22:31 +01:00
LocalAI [bot]	9e1b0d0c82	fix: Add timeout-based wait for model deletion completion (#8756 ) * fix: Add timeout-based wait for model deletion completion - Replace simple polling loop with context-based timeout (5 minutes) - Use select statement for cleaner timeout handling - Added proper logging for timeout case - This addresses the code review comment about using context with timeout instead of dangerous polling approach * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * fix: replace goto statements with break in model deletion loop (fixes CI compilation error) Signed-off-by: LocalAI [bot] <localai-bot@noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: LocalAI [bot] <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: LocalAI [bot] <localai-bot@noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-06 01:07:15 +01:00
Ettore Di Giacinto	580517f9db	feat: pass-by metadata to predict options (#8795 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-05 22:50:10 +01:00
LocalAI [bot]	0cf7c18177	chore: ⬆️ Update ggml-org/llama.cpp to `a0ed91a442ea6b013bd42ebc3887a81792eaefa1` (#8797 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-05 22:49:45 +01:00
LocalAI [bot]	ac91413eb2	chore: ⬆️ Update ggml-org/whisper.cpp to `30c5194c9691e4e9a98b3dea9f19727397d3f46e` (#8796 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-05 22:49:32 +01:00
Ettore Di Giacinto	86680ff8bc	fix(ui): fix /app redirect Do not handle redirect individually, but serve the app directly in / Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-05 21:43:46 +00:00
Ettore Di Giacinto	09ddaf94b2	feat(ui): move to React for frontend (#8772 ) * feat(ui): move to React Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add import model Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * syntax highlight Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Minor fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-05 21:47:12 +01:00
LocalAI [bot]	61c139fa7d	feat: Rename 'Whisper' model type to 'STT' in UI (#8785 ) * feat: Rename 'Whisper' model type to 'STT' in UI - Updated models.html: Changed 'Whisper' filter button to 'STT' - Updated talk.html: Changed 'Whisper Model' to 'STT Model' - Updated backends.html: Changed 'Whisper' to 'STT' - Updated talk.js: Renamed getWhisperModel() to getSTTModel(), sendAudioToWhisper() to sendAudioToSTT(), and whisperModelSelect to sttModelSelect This change makes the UI more consistent with the model category naming, where all speech-to-text models (including Whisper, Parakeet, Moonshine, WhisperX, etc.) are grouped under the 'STT' (Speech-to-Text) category. Fixes #8776 Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> * Rename whisperModelSelect to sttModelSelect in talk.html As requested by maintainer mudler in PR review, replacing all whisperModelSelect occurrences with sttModelSelect since the model type was renamed from Whisper to STT. Signed-off-by: LocalAI [bot] <localai-bot@users.noreply.github.com> --------- Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Signed-off-by: LocalAI [bot] <localai-bot@users.noreply.github.com> Co-authored-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Co-authored-by: LocalAI [bot] <localai-bot@users.noreply.github.com>	2026-03-05 09:51:47 +01:00
LocalAI [bot]	9fc77909e0	fix: Add vllm-omni backend to video generation model detection (#8659 ) (#8781 ) fix: Add vllm-omni backend to video generation model detection - Include vllm-omni in the list of backends that support FLAG_VIDEO - This allows models like vllm-omni-wan2.2-t2v to appear in the video model selector UI - Fixes issue #8659 where video generation models using vllm-omni backend were not showing in the dropdown Co-authored-by: team-coding-agent-1 <team-coding-agent-1@localai.dev>	2026-03-05 01:04:47 +01:00
LocalAI [bot]	3dce20b026	docs: add autonomous development team section to README (#8780 ) * docs: add autonomous development team section to README - Add blog post link to Media, Blogs, Social section - Add new section about autonomous AI agent maintenance team - Include links to reports.localai.io and project board - Reference the experiment blog post * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-04 23:52:36 +01:00
LocalAI [bot]	f25e450414	chore: ⬆️ Update ggml-org/llama.cpp to `24d2ee052795063afffc9732465ca1b1c65f4a28` (#8777 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-04 23:25:48 +01:00
LocalAI [bot]	ee06892cd5	chore(model-gallery): ⬆️ update checksum (#8778 ) ⬆️ Checksum updates in gallery/index.yaml Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-04 22:02:14 +01:00
Ettore Di Giacinto	c25dfcc9b4	Update model and OPENAI_MODE in gallery-agent.yaml Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-04 18:28:53 +01:00
Ettore Di Giacinto	016738a787	Remove descriptions from model entries in index.yaml Removed model descriptions for several entries in the gallery. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-04 18:26:45 +01:00
LocalAI [bot]	2938fe5cad	chore(model gallery): 🤖 add 1 new models via gallery agent (#8770 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-04 15:55:08 +01:00
Andres	454d8adc76	feat(qwen-tts): Support using multiple voices (#8757 ) * Add support for multiple voice clones in Qwen TTS Signed-off-by: Andres Smith <andressmithdev@pm.me> * Add voice prompt caching and generation logs to see generation time --------- Signed-off-by: Andres Smith <andressmithdev@pm.me> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-04 09:47:21 +01:00
LocalAI [bot]	6002c940a9	chore: ⬆️ Update ggml-org/llama.cpp to `ecd99d6a9acbc436bad085783bcd5d0b9ae9e9e9` (#8762 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-04 08:08:37 +01:00
Ettore Di Giacinto	8e6fe4531e	chore(ci): update environment variable for external backend Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-03 22:12:37 +01:00
Ettore Di Giacinto	5203fb37a6	fix(ci): remove erroneus abspath call Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-03 19:07:39 +01:00
LocalAI [bot]	eb2a656575	fix: return full embedding dimensions instead of truncating trailing zeros (#8721 ) (#8755 ) fix: return full embedding dimensions instead of truncating trailing zeros - Remove the logic that strips trailing zeros from embeddings - Trailing zeros may be valid values in some embedding models - This fixes the issue where embeddings like jina-v3 returned only 1/4 of their native dimensions (256 instead of 1024) - The truncation was causing vector database dimension mismatch errors - Fixes issue #8721 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-03 17:08:16 +01:00
LocalAI [bot]	6e5a58ca70	feat: Add Free RPC to backend.proto for VRAM cleanup (#8751 ) * fix: Add VRAM cleanup when stopping models - Add Free() method to AIModel interface for proper GPU resource cleanup - Implement Free() in llama backend to release llama.cpp model resources - Add Free() stub implementations in base and SingleThread backends - Modify deleteProcess() to call Free() before stopping the process to ensure VRAM is properly released when models are unloaded Fixes issue where VRAM was not freed when stopping models, which could lead to memory exhaustion when running multiple models sequentially. * feat: Add Free RPC to backend.proto for VRAM cleanup\n\n- Add rpc Free(HealthMessage) returns (Result) {} to backend.proto\n- This RPC is required to properly expose the Free() method\n through the gRPC interface for VRAM resource cleanup\n\nRefs: PR #8739 * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-03 12:39:06 +01:00
Ettore Di Giacinto	1c8db3846d	chore(faster-qwen3-tts): Add anyio to requirements.txt Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-03 09:43:29 +01:00

1 2 3 4 5 ...

5718 Commits