LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-07-01 03:46:41 -04:00

Author	SHA1	Message	Date
Ettore Di Giacinto	b2f81bfa2e	feat(functions): add peg-based parsing and allow backends to return tool calls directly (#8838 ) * feat(functions): add peg-based parsing Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: support returning toolcalls directly from backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: do run PEG only if backend didn't send deltas Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-08 22:21:57 +01:00
LocalAI [bot]	05b7cce633	feat: add Events column to Agents list page (#8870 ) - Add 'Events' column header between 'Status' and 'Actions' - Fetch observable counts for each agent using /api/agents/<name>/observables - Display events count as clickable link navigating to agent status page - Events count updates every 5 seconds with agent refresh interval - Shows '0' if API call fails for an agent Co-authored-by: localai-bot <localai-bot@noreply.github.com>	2026-03-08 21:15:29 +01:00
Ettore Di Giacinto	ac48867b7d	feat: add agentic management (#8820 ) * feat: add standalone and agentic functionalities Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose agents via responses api Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-07 00:03:08 +01:00
LocalAI [bot]	ab315f2725	feat: Add LOCALAI_DISABLE_MCP environment variable to disable MCP support (#8816 ) * feat: Add LOCALAI_DISABLE_MCP environment variable to disable MCP support - Added DisableMCP field to RunCMD struct in core/cli/run.go - Added LOCALAI_DISABLE_MCP environment variable support - Added DisableMCP field to ApplicationConfig struct - Added DisableMCP AppOption function - Updated MCP endpoint routing to check appConfig.DisableMCP - When LOCALAI_DISABLE_MCP is set to true/1/yes, MCP endpoints are not registered When set, all MCP functionality is disabled and appropriate error messages are returned to users. Use Cases: - Security-conscious deployments where MCP is not needed - Reducing attack surface - Compliance requirements that prohibit certain protocol support Environment variable: LOCALAI_DISABLE_MCP=true Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> * docs: Add documentation for LOCALAI_DISABLE_MCP environment variable - Add section explaining how to disable MCP support using environment variable - Document use cases for disabling MCP - Provide examples for CLI and Docker usage Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> --------- Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-06 20:44:03 +01:00
BitToby	96efa4fce0	feat: add WebSocket mode support for the response api (#8676 ) * feat: add WebSocket mode support for the response api Signed-off-by: bittoby <218712309+bittoby@users.noreply.github.com> * test: add e2e tests for WebSocket Responses API Signed-off-by: bittoby <218712309+bittoby@users.noreply.github.com> --------- Signed-off-by: bittoby <218712309+bittoby@users.noreply.github.com>	2026-03-06 10:36:59 +00:00
Ettore Di Giacinto	e82b861961	fix(ui): do not lock all components during load Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-06 09:35:01 +01:00
LocalAI [bot]	9e1b0d0c82	fix: Add timeout-based wait for model deletion completion (#8756 ) * fix: Add timeout-based wait for model deletion completion - Replace simple polling loop with context-based timeout (5 minutes) - Use select statement for cleaner timeout handling - Added proper logging for timeout case - This addresses the code review comment about using context with timeout instead of dangerous polling approach * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * fix: replace goto statements with break in model deletion loop (fixes CI compilation error) Signed-off-by: LocalAI [bot] <localai-bot@noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: LocalAI [bot] <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: LocalAI [bot] <localai-bot@noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-06 01:07:15 +01:00
Ettore Di Giacinto	580517f9db	feat: pass-by metadata to predict options (#8795 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-05 22:50:10 +01:00
Ettore Di Giacinto	86680ff8bc	fix(ui): fix /app redirect Do not handle redirect individually, but serve the app directly in / Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-05 21:43:46 +00:00
Ettore Di Giacinto	09ddaf94b2	feat(ui): move to React for frontend (#8772 ) * feat(ui): move to React Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add import model Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * syntax highlight Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Minor fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-05 21:47:12 +01:00
LocalAI [bot]	61c139fa7d	feat: Rename 'Whisper' model type to 'STT' in UI (#8785 ) * feat: Rename 'Whisper' model type to 'STT' in UI - Updated models.html: Changed 'Whisper' filter button to 'STT' - Updated talk.html: Changed 'Whisper Model' to 'STT Model' - Updated backends.html: Changed 'Whisper' to 'STT' - Updated talk.js: Renamed getWhisperModel() to getSTTModel(), sendAudioToWhisper() to sendAudioToSTT(), and whisperModelSelect to sttModelSelect This change makes the UI more consistent with the model category naming, where all speech-to-text models (including Whisper, Parakeet, Moonshine, WhisperX, etc.) are grouped under the 'STT' (Speech-to-Text) category. Fixes #8776 Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> * Rename whisperModelSelect to sttModelSelect in talk.html As requested by maintainer mudler in PR review, replacing all whisperModelSelect occurrences with sttModelSelect since the model type was renamed from Whisper to STT. Signed-off-by: LocalAI [bot] <localai-bot@users.noreply.github.com> --------- Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Signed-off-by: LocalAI [bot] <localai-bot@users.noreply.github.com> Co-authored-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Co-authored-by: LocalAI [bot] <localai-bot@users.noreply.github.com>	2026-03-05 09:51:47 +01:00
Ettore Di Giacinto	8e6fe4531e	chore(ci): update environment variable for external backend Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-03 22:12:37 +01:00
LocalAI [bot]	eca2c6e01c	fix: Implement responsive line wrapping for model names (#8209 ) (#8720 ) fix: Implement responsive line wrapping for model names on home page - Changed model name display from truncate to break-words - Increased max-width from 100px to 200px to allow more text - This fixes issue #8209 for responsive text wrapping on smaller screens Fixes: #8209 Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-02 13:54:58 +01:00
Ettore Di Giacinto	c7c4a20a9e	fix: retry when LLM returns empty messages (#8704 ) * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * retry instead of re-computing a response Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-01 21:32:38 +01:00
Ettore Di Giacinto	983db7bedc	feat(ui): add model size estimation (#8684 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-28 23:03:47 +01:00
LocalAI [bot]	b10443ab5a	feat(models): add model storage size display and RAM warning (#8675 ) Add model storage size display and RAM warning in Models tab - Backend (ui_api.go): - Added getDirectorySize() helper function to calculate total size of model files - Added storageSize, ramTotal, ramUsed, ramUsagePercent to /api/models endpoint response - Uses xsysinfo.GetSystemRAMInfo() for RAM information - Frontend (models.html): - Added storageSize, ramTotal, ramUsed, ramUsagePercent to Alpine.js data object - Added formatBytes() helper for human-readable byte formatting - Display storage size in hero header with blue indicator - Show warning banner when storage exceeds RAM (model too large for system) Addresses: https://github.com/mudler/LocalAI/issues/6251 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-28 22:05:01 +01:00
LocalAI [bot]	b647b6caf1	fix: properly sync model selection dropdown in video generation UI (#8680 ) fix(video): initialize model selection dropdown with current model value The Alpine.js link variable was starting empty, causing the dropdown selection to not reflect the currently selected model. This fix initializes the link variable with the current model value from the template (e.g., video/{{.Model}}), following the same pattern used in image.html. Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-28 13:11:33 +01:00
Ettore Di Giacinto	657ba8cdad	fix(chat): do not send thinking/reasoning messages to the LLM (#8656 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-26 00:06:35 +01:00
LocalAI [bot]	1027c487a6	fix: reload model after editing YAML config (issue #8647 ) (#8652 ) fix: reload model configuration after editing (issue #8647) - Add *model.ModelLoader parameter to EditModelEndpoint - Call ml.ShutdownModel() after saving config to unload the running model - Model will be reloaded on next inference request with new settings (e.g., context_size) - Update route registration to pass ml to EditModelEndpoint Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-25 22:18:42 +01:00
Copilot	3ac7301f31	Add `sample_rate` support to TTS API via post-processing resampling (#8650 ) * Initial plan * Add TTS sample_rate support via AudioResample post-processing Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-25 16:36:27 +01:00
LocalAI [bot]	36ff2a0138	fix(webui): use different icon for System nav item (#8648 ) Change the System nav item icon from fas fa-server to fas fa-desktop to distinguish it from the Backends nav item which still uses fa-server. Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-24 17:10:58 +01:00
Richard Palethorpe	be84b1d258	feat(traces): Use accordian instead of pop-ups (#8626 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-23 13:07:41 +01:00
Andres	cbedcc9091	fix(api): Downgrade health/readiness check to debug (#8625 ) Downgrade health/readiness check to debug Signed-off-by: Andres Smith <andressmithdev@pm.me>	2026-02-23 11:58:04 +01:00
Richard Palethorpe	b1b67b973e	fix(realtime): Add functions to conversation history (#8616 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-21 19:03:49 +01:00
Ettore Di Giacinto	51902df7ba	fix: merge openresponses messages (#8615 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-21 09:56:43 +01:00
Richard Palethorpe	51eec4e6b8	feat(traces): Add backend traces (#8609 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-20 23:47:33 +01:00
Ettore Di Giacinto	352b8aaa1b	fix(ui): pass by needed values to unbreak model editor Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-20 09:06:17 +01:00
Ettore Di Giacinto	df792d6243	chore(ui): improve navigation and buttons placement (#8608 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-19 23:41:05 +01:00
Ettore Di Giacinto	b471619ad9	chore(deps): bump cogito and add new options to the agent config (#8601 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-18 22:10:26 +01:00
Ettore Di Giacinto	a2228f1418	fix(ui): improve view on mobile (#8598 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-18 19:50:59 +01:00
Ettore Di Giacinto	7dd9a155a3	fix(ui): drop duplicated footer import Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-18 14:38:53 +01:00
Richard Palethorpe	4fe830ff58	fix(realtime): Limit buffer sizes to prevent DoS (#8596 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-18 14:36:43 +01:00
Richard Palethorpe	86b3bc9313	fix(realtime): Better support for thinking models and setting model parameters (#8595 ) * fix(realtime): Wrap functions in OpenAI chat completions format Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat(realtime): Set max tokens from session object Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(realtime): Find thinking start tag for thinking extraction Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(realtime): Don't send buffer cleared message when we automatically drop it Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-18 14:36:16 +01:00
Ettore Di Giacinto	2fabdc08e6	feat(ui): left navbar, dark/light theme (#8594 ) * feat(ui): left navbar, dark/light theme Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * darker background Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-18 00:14:39 +01:00
Ettore Di Giacinto	1c4e5aa5c0	chore: bump cogito (#8568 ) Adapt to new API and drop call to Ask() Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-14 22:52:22 +01:00
Richard Palethorpe	5bdbb10593	fix(realtime): Send proper image data to backend (#8547 ) * fix(realtime): Allow empty parameters Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(realtime): Just pass base64 string to backend Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-13 18:01:07 +01:00
Richard Palethorpe	f6c80a6987	feat(realtime): Allow sending text, image and audio conversation items" (#8524 ) feat(realtime): Allow sending text and image conversation items Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-02-12 19:33:46 +00:00
LocalAI [bot]	b10b85de52	chore: improve log levels verbosity (#8528 ) * chore: init for PR * feat: improve log verbosity per #8449 - demote /api/resources to DEBUG, elevate job events to INFO --------- Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-12 16:24:46 +01:00
Richard Palethorpe	1479bee894	fix(realtime): Sampling and websocket locking (#8521 ) * fix(realtime): Use locked websocket for concurrent access Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(realtime): Use sample rate set in session Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(config): Allow pipelines to have no model parameters Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-12 13:57:34 +01:00
Richard Palethorpe	7270a98ce5	fix(realtime): Use user provided voice and allow pipeline models to have no backend (#8415 ) * fix(realtime): Use the voice provided by the user or none at all Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(ui,config): Allow pipeline models to have no backend and use same validation in frontend Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-02-11 14:18:05 +01:00
Kolega.dev	780877d1d0	security: validate URLs to prevent SSRF in content fetching endpoints (#8476 ) User-supplied URLs passed to GetContentURIAsBase64() and downloadFile() were fetched without validation, allowing SSRF attacks against internal services. Added URL validation that blocks private IPs, loopback, link-local, and cloud metadata endpoints before fetching. Co-authored-by: kolega.dev <faizan@kolega.ai>	2026-02-10 15:14:14 +01:00
Ettore Di Giacinto	a849f285a5	chore(tests): add audio/wav to expected wav file	2026-02-05 20:27:06 +00:00
Ettore Di Giacinto	697f6aa71c	feat(audio): set audio content type (#8416 ) * feat(audio): set audio content type Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: add tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-05 19:14:12 +01:00
Ettore Di Giacinto	53276d28e7	feat(musicgen): add ace-step and UI interface (#8396 ) * feat(musicgen): add ace-step and UI interface Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Correctly handle model dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop auto-download Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to models, fixup UIs icons Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * l4t13 is incompatbile Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * avoid pinning version for cuda12 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop l4t12 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-05 12:04:53 +01:00
Richard Palethorpe	5195062e12	fix(realtime): Include noAction function in prompt template and handle tool_choice (#8372 ) The realtime endpoint was not passing the noAction "answer" function to the model in the prompt template, causing the model to always call user-provided tools even when a direct response was appropriate. Root cause: - User tools were added to the funcs list - TemplateMessages() was called to generate the prompt - noAction function was only added AFTER templating - This meant the prompt didn't include the "answer" function, even though the grammar did Fix: - Move noAction function creation before TemplateMessages() call so it's included in both the prompt and grammar - Add proper tool_choice parameter handling to support "auto", "required", "none", and specific function selection - Match behavior of the standard chat endpoint 💘 Generated with Crush Assisted-by: Claude Sonnet 4.5 via Crush <crush@charm.land> Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-03 14:30:37 +01:00
Alex O'Connell	b7585ca738	fix(api): Add missing field in initial OpenAI streaming response (#8341 ) Add missing field in initial OpenAI streaming response Signed-off-by: Alex O'Connell <35843486+acon96@users.noreply.github.com>	2026-02-02 08:30:04 +01:00
Andres	b6459ddd57	feat(api): Add transcribe response format request parameter & adjust STT backends (#8318 ) * WIP response format implementation for audio transcriptions (cherry picked from commit e271dd764bbc13846accf3beb8b6522153aa276f) Signed-off-by: Andres Smith <andressmithdev@pm.me> * Rework transcript response_format and add more formats (cherry picked from commit 6a93a8f63e2ee5726bca2980b0c9cf4ef8b7aeb8) Signed-off-by: Andres Smith <andressmithdev@pm.me> * Add test and replace go-openai package with official openai go client (cherry picked from commit f25d1a04e46526429c89db4c739e1e65942ca893) Signed-off-by: Andres Smith <andressmithdev@pm.me> * Fix faster-whisper backend and refactor transcription formatting to also work on CLI Signed-off-by: Andres Smith <andressmithdev@pm.me> (cherry picked from commit 69a93977d5e113eb7172bd85a0f918592d3d2168) Signed-off-by: Andres Smith <andressmithdev@pm.me> --------- Signed-off-by: Andres Smith <andressmithdev@pm.me> Co-authored-by: nanoandrew4 <nanoandrew4@gmail.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-02-01 17:33:17 +01:00
Ettore Di Giacinto	397f7f0862	fix(ui): take account of reasoning in token count calculation (#8324 ) We were skipping reasoning traces when counting tokens, yielding to a wrong sum count. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-01 10:48:31 +01:00
Ettore Di Giacinto	4077aaf978	chore: re-enable e2e tests, fixups anthropic API tools support (#8296 ) * chore(tests): add mock backend e2e tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup anthropic tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * prepare e2e tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop repetitive tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop specific CI workflow Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup anthropic issues, move all e2e tests to use mocked backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-30 12:41:50 +01:00
Ettore Di Giacinto	68dd9765a0	feat(tts): add support for streaming mode (#8291 ) * feat(tts): add support for streaming mode Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Send first audio, make sure it's 16 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-30 11:58:01 +01:00

1 2 3 4 5 ...

392 Commits