LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-05-17 21:21:23 -04:00

Author	SHA1	Message	Date
Ettore Di Giacinto	e82b861961	fix(ui): do not lock all components during load Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-06 09:35:01 +01:00
LocalAI [bot]	9e1b0d0c82	fix: Add timeout-based wait for model deletion completion (#8756 ) * fix: Add timeout-based wait for model deletion completion - Replace simple polling loop with context-based timeout (5 minutes) - Use select statement for cleaner timeout handling - Added proper logging for timeout case - This addresses the code review comment about using context with timeout instead of dangerous polling approach * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * fix: replace goto statements with break in model deletion loop (fixes CI compilation error) Signed-off-by: LocalAI [bot] <localai-bot@noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: LocalAI [bot] <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: LocalAI [bot] <localai-bot@noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-06 01:07:15 +01:00
Ettore Di Giacinto	580517f9db	feat: pass-by metadata to predict options (#8795 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-05 22:50:10 +01:00
Ettore Di Giacinto	86680ff8bc	fix(ui): fix /app redirect Do not handle redirect individually, but serve the app directly in / Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-05 21:43:46 +00:00
Ettore Di Giacinto	09ddaf94b2	feat(ui): move to React for frontend (#8772 ) * feat(ui): move to React Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add import model Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * syntax highlight Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Minor fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-05 21:47:12 +01:00
LocalAI [bot]	61c139fa7d	feat: Rename 'Whisper' model type to 'STT' in UI (#8785 ) * feat: Rename 'Whisper' model type to 'STT' in UI - Updated models.html: Changed 'Whisper' filter button to 'STT' - Updated talk.html: Changed 'Whisper Model' to 'STT Model' - Updated backends.html: Changed 'Whisper' to 'STT' - Updated talk.js: Renamed getWhisperModel() to getSTTModel(), sendAudioToWhisper() to sendAudioToSTT(), and whisperModelSelect to sttModelSelect This change makes the UI more consistent with the model category naming, where all speech-to-text models (including Whisper, Parakeet, Moonshine, WhisperX, etc.) are grouped under the 'STT' (Speech-to-Text) category. Fixes #8776 Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> * Rename whisperModelSelect to sttModelSelect in talk.html As requested by maintainer mudler in PR review, replacing all whisperModelSelect occurrences with sttModelSelect since the model type was renamed from Whisper to STT. Signed-off-by: LocalAI [bot] <localai-bot@users.noreply.github.com> --------- Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Signed-off-by: LocalAI [bot] <localai-bot@users.noreply.github.com> Co-authored-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Co-authored-by: LocalAI [bot] <localai-bot@users.noreply.github.com>	2026-03-05 09:51:47 +01:00
LocalAI [bot]	9fc77909e0	fix: Add vllm-omni backend to video generation model detection (#8659 ) (#8781 ) fix: Add vllm-omni backend to video generation model detection - Include vllm-omni in the list of backends that support FLAG_VIDEO - This allows models like vllm-omni-wan2.2-t2v to appear in the video model selector UI - Fixes issue #8659 where video generation models using vllm-omni backend were not showing in the dropdown Co-authored-by: team-coding-agent-1 <team-coding-agent-1@localai.dev>	2026-03-05 01:04:47 +01:00
Ettore Di Giacinto	8e6fe4531e	chore(ci): update environment variable for external backend Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-03 22:12:37 +01:00
LocalAI [bot]	eb2a656575	fix: return full embedding dimensions instead of truncating trailing zeros (#8721 ) (#8755 ) fix: return full embedding dimensions instead of truncating trailing zeros - Remove the logic that strips trailing zeros from embeddings - Trailing zeros may be valid values in some embedding models - This fixes the issue where embeddings like jina-v3 returned only 1/4 of their native dimensions (256 instead of 1024) - The truncation was causing vector database dimension mismatch errors - Fixes issue #8721 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-03 17:08:16 +01:00
LocalAI [bot]	6d182281cf	fix: allow reranking models configured with known_usecases (#8681 ) When a model is configured with 'known_usecases: [rerank]' in the YAML config, the reranking endpoint was not being matched because: 1. The GuessUsecases function only checked for backend == 'rerankers' 2. The syncKnownUsecasesFromString() was not being called when loading configs via yaml.Unmarshal in readModelConfigsFromFile This fix: 1. Updates GuessUsecases to also check if Reranking is explicitly set to true in the model config (in addition to checking backend type) 2. Adds syncKnownUsecasesFromString() calls after yaml.Unmarshal in readModelConfigsFromFile to ensure known_usecases are properly parsed Fixes #8658 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-02 19:00:18 +01:00
LocalAI [bot]	eca2c6e01c	fix: Implement responsive line wrapping for model names (#8209 ) (#8720 ) fix: Implement responsive line wrapping for model names on home page - Changed model name display from truncate to break-words - Increased max-width from 100px to 200px to allow more text - This fixes issue #8209 for responsive text wrapping on smaller screens Fixes: #8209 Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-02 13:54:58 +01:00
Ettore Di Giacinto	c7c4a20a9e	fix: retry when LLM returns empty messages (#8704 ) * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * retry instead of re-computing a response Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-01 21:32:38 +01:00
Ettore Di Giacinto	983db7bedc	feat(ui): add model size estimation (#8684 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-28 23:03:47 +01:00
LocalAI [bot]	b10443ab5a	feat(models): add model storage size display and RAM warning (#8675 ) Add model storage size display and RAM warning in Models tab - Backend (ui_api.go): - Added getDirectorySize() helper function to calculate total size of model files - Added storageSize, ramTotal, ramUsed, ramUsagePercent to /api/models endpoint response - Uses xsysinfo.GetSystemRAMInfo() for RAM information - Frontend (models.html): - Added storageSize, ramTotal, ramUsed, ramUsagePercent to Alpine.js data object - Added formatBytes() helper for human-readable byte formatting - Display storage size in hero header with blue indicator - Show warning banner when storage exceeds RAM (model too large for system) Addresses: https://github.com/mudler/LocalAI/issues/6251 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-28 22:05:01 +01:00
LocalAI [bot]	b647b6caf1	fix: properly sync model selection dropdown in video generation UI (#8680 ) fix(video): initialize model selection dropdown with current model value The Alpine.js link variable was starting empty, causing the dropdown selection to not reflect the currently selected model. This fix initializes the link variable with the current model value from the template (e.g., video/{{.Model}}), following the same pattern used in image.html. Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-28 13:11:33 +01:00
LocalAI [bot]	c187b160e7	fix(gallery): clean up partially downloaded backend on installation failure (#8679 ) When a backend download fails (e.g., on Mac OS with port conflicts causing connection issues), the backend directory is left with partial files. This causes subsequent installation attempts to fail with 'run file not found' because the sanity check runs on an empty/partial directory. This fix cleans up the backend directory when the initial download fails before attempting fallback URIs or mirrors. This ensures a clean state for retry attempts. Fixes: #8016 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-28 13:10:53 +01:00
LocalAI [bot]	959458f0db	fix(gallery): add fallback URI resolution for backend installation (#8663 ) * fix(gallery): add fallback URI resolution for backend installation When a backend installation fails (e.g., due to missing 'latest-' tag), try fallback URIs in order: 1. Replace 'latest-' with 'master-' in the URI 2. If that fails, append '-development' to the backend name This fixes the issue where backend index entries don't match the repository tags. For example, installing 'ace-step' tries to download 'latest-gpu-nvidia-cuda-13-ace-step' but only 'master-gpu-nvidia-cuda-13-ace-step' exists in the quay.io registry. Fixes: #8437 Signed-off-by: localai-bot <139863280+localai-bot@users.noreply.github.com> * chore(gallery): make fallback URI patterns configurable via env vars --------- Signed-off-by: localai-bot <139863280+localai-bot@users.noreply.github.com>	2026-02-27 10:56:33 +01:00
LocalAI [bot]	8bfe458fbc	fix: change file permissions from 0600 to 0644 in InstallModel (#8657 ) Closes #8119 When installing models from the gallery, files are created with 0600 permissions (owner read/write only), making them unreadable by the LocalAI server when running as a different user. This fix changes the permissions to 0644 (owner read/write, group/others read), allowing the server to read model files regardless of the user it runs as. Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-26 09:38:54 +01:00
Ettore Di Giacinto	657ba8cdad	fix(chat): do not send thinking/reasoning messages to the LLM (#8656 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-26 00:06:35 +01:00
LocalAI [bot]	1027c487a6	fix: reload model after editing YAML config (issue #8647 ) (#8652 ) fix: reload model configuration after editing (issue #8647) - Add *model.ModelLoader parameter to EditModelEndpoint - Call ml.ShutdownModel() after saving config to unload the running model - Model will be reloaded on next inference request with new settings (e.g., context_size) - Update route registration to pass ml to EditModelEndpoint Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-25 22:18:42 +01:00
Copilot	3ac7301f31	Add `sample_rate` support to TTS API via post-processing resampling (#8650 ) * Initial plan * Add TTS sample_rate support via AudioResample post-processing Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-25 16:36:27 +01:00
LocalAI [bot]	36ff2a0138	fix(webui): use different icon for System nav item (#8648 ) Change the System nav item icon from fas fa-server to fas fa-desktop to distinguish it from the Backends nav item which still uses fa-server. Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-24 17:10:58 +01:00
Lukas Schaefer	ed0bfb8732	fix: rename json_verbose to verbose_json (#8627 ) Signed-off-by: Lukas Schaefer <lukas@lschaefer.xyz>	2026-02-23 17:57:06 +00:00
Richard Palethorpe	be84b1d258	feat(traces): Use accordian instead of pop-ups (#8626 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-23 13:07:41 +01:00
Andres	cbedcc9091	fix(api): Downgrade health/readiness check to debug (#8625 ) Downgrade health/readiness check to debug Signed-off-by: Andres Smith <andressmithdev@pm.me>	2026-02-23 11:58:04 +01:00
Andres	e45d63c86e	fix(cli): Fix watchdog running constantly and spamming logs (#8624 ) * Fix watchdog running constantly and spamming logs Signed-off-by: Andres Smith <andressmithdev@pm.me> * Update docs Signed-off-by: Andres Smith <andressmithdev@pm.me> --------- Signed-off-by: Andres Smith <andressmithdev@pm.me>	2026-02-23 11:57:28 +01:00
Richard Palethorpe	b1b67b973e	fix(realtime): Add functions to conversation history (#8616 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-21 19:03:49 +01:00
Ettore Di Giacinto	51902df7ba	fix: merge openresponses messages (#8615 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-21 09:56:43 +01:00
Richard Palethorpe	51eec4e6b8	feat(traces): Add backend traces (#8609 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-20 23:47:33 +01:00
Ettore Di Giacinto	352b8aaa1b	fix(ui): pass by needed values to unbreak model editor Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-20 09:06:17 +01:00
Ettore Di Giacinto	df792d6243	chore(ui): improve navigation and buttons placement (#8608 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-19 23:41:05 +01:00
Ettore Di Giacinto	b471619ad9	chore(deps): bump cogito and add new options to the agent config (#8601 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-18 22:10:26 +01:00
Ettore Di Giacinto	a2228f1418	fix(ui): improve view on mobile (#8598 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-18 19:50:59 +01:00
Ettore Di Giacinto	7dd9a155a3	fix(ui): drop duplicated footer import Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-18 14:38:53 +01:00
Richard Palethorpe	4fe830ff58	fix(realtime): Limit buffer sizes to prevent DoS (#8596 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-18 14:36:43 +01:00
Richard Palethorpe	86b3bc9313	fix(realtime): Better support for thinking models and setting model parameters (#8595 ) * fix(realtime): Wrap functions in OpenAI chat completions format Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat(realtime): Set max tokens from session object Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(realtime): Find thinking start tag for thinking extraction Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(realtime): Don't send buffer cleared message when we automatically drop it Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-18 14:36:16 +01:00
Ettore Di Giacinto	2fabdc08e6	feat(ui): left navbar, dark/light theme (#8594 ) * feat(ui): left navbar, dark/light theme Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * darker background Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-18 00:14:39 +01:00
Ettore Di Giacinto	ecba23d44e	fix: improve watchdown logics (#8591 ) * fix: ensure proper watchdog shutdown and state passing between restarts Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: add missing watchdog settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: untrack model if we shut it down successfully Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-17 18:49:22 +01:00
Richard Palethorpe	074a982853	fix(gallery): Use YAML v3 to avoid merging maps with incompatible keys (#8580 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-16 14:10:19 +01:00
Ettore Di Giacinto	1c4e5aa5c0	chore: bump cogito (#8568 ) Adapt to new API and drop call to Ask() Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-14 22:52:22 +01:00
Ettore Di Giacinto	bd12103ed4	chore: compute capabilities once (#8555 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-13 22:23:06 +01:00
Richard Palethorpe	5bdbb10593	fix(realtime): Send proper image data to backend (#8547 ) * fix(realtime): Allow empty parameters Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(realtime): Just pass base64 string to backend Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-13 18:01:07 +01:00
Richard Palethorpe	f6c80a6987	feat(realtime): Allow sending text, image and audio conversation items" (#8524 ) feat(realtime): Allow sending text and image conversation items Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-02-12 19:33:46 +00:00
LocalAI [bot]	b10b85de52	chore: improve log levels verbosity (#8528 ) * chore: init for PR * feat: improve log verbosity per #8449 - demote /api/resources to DEBUG, elevate job events to INFO --------- Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-12 16:24:46 +01:00
Richard Palethorpe	1479bee894	fix(realtime): Sampling and websocket locking (#8521 ) * fix(realtime): Use locked websocket for concurrent access Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(realtime): Use sample rate set in session Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(config): Allow pipelines to have no model parameters Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-12 13:57:34 +01:00
Richard Palethorpe	7270a98ce5	fix(realtime): Use user provided voice and allow pipeline models to have no backend (#8415 ) * fix(realtime): Use the voice provided by the user or none at all Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(ui,config): Allow pipeline models to have no backend and use same validation in frontend Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-02-11 14:18:05 +01:00
Kolega.dev	780877d1d0	security: validate URLs to prevent SSRF in content fetching endpoints (#8476 ) User-supplied URLs passed to GetContentURIAsBase64() and downloadFile() were fetched without validation, allowing SSRF attacks against internal services. Added URL validation that blocks private IPs, loopback, link-local, and cloud metadata endpoints before fetching. Co-authored-by: kolega.dev <faizan@kolega.ai>	2026-02-10 15:14:14 +01:00
Andres	efd552f83e	fix(api)!: Stop model prior to deletion (#8422 ) * Unload model prior to deletion Signed-off-by: Andres Smith <andressmithdev@pm.me> * Fix LFM model in gallery Signed-off-by: Andres Smith <andressmithdev@pm.me> * Remove mistakenly added files Signed-off-by: Andres Smith <andressmithdev@pm.me> --------- Signed-off-by: Andres Smith <andressmithdev@pm.me>	2026-02-06 09:22:10 +01:00
Ettore Di Giacinto	a849f285a5	chore(tests): add audio/wav to expected wav file	2026-02-05 20:27:06 +00:00
Ettore Di Giacinto	697f6aa71c	feat(audio): set audio content type (#8416 ) * feat(audio): set audio content type Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: add tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-05 19:14:12 +01:00

1 2 3 4 5 ...

553 Commits