LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-07-05 13:57:28 -04:00

Author	SHA1	Message	Date
Ettore Di Giacinto	c25dfcc9b4	Update model and OPENAI_MODE in gallery-agent.yaml Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-04 18:28:53 +01:00
Ettore Di Giacinto	016738a787	Remove descriptions from model entries in index.yaml Removed model descriptions for several entries in the gallery. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-04 18:26:45 +01:00
LocalAI [bot]	2938fe5cad	chore(model gallery): 🤖 add 1 new models via gallery agent (#8770 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-04 15:55:08 +01:00
Andres	454d8adc76	feat(qwen-tts): Support using multiple voices (#8757 ) * Add support for multiple voice clones in Qwen TTS Signed-off-by: Andres Smith <andressmithdev@pm.me> * Add voice prompt caching and generation logs to see generation time --------- Signed-off-by: Andres Smith <andressmithdev@pm.me> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-04 09:47:21 +01:00
LocalAI [bot]	6002c940a9	chore: ⬆️ Update ggml-org/llama.cpp to `ecd99d6a9acbc436bad085783bcd5d0b9ae9e9e9` (#8762 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-04 08:08:37 +01:00
Ettore Di Giacinto	8e6fe4531e	chore(ci): update environment variable for external backend Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-03 22:12:37 +01:00
Ettore Di Giacinto	5203fb37a6	fix(ci): remove erroneus abspath call Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-03 19:07:39 +01:00
LocalAI [bot]	eb2a656575	fix: return full embedding dimensions instead of truncating trailing zeros (#8721 ) (#8755 ) fix: return full embedding dimensions instead of truncating trailing zeros - Remove the logic that strips trailing zeros from embeddings - Trailing zeros may be valid values in some embedding models - This fixes the issue where embeddings like jina-v3 returned only 1/4 of their native dimensions (256 instead of 1024) - The truncation was causing vector database dimension mismatch errors - Fixes issue #8721 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-03 17:08:16 +01:00
LocalAI [bot]	6e5a58ca70	feat: Add Free RPC to backend.proto for VRAM cleanup (#8751 ) * fix: Add VRAM cleanup when stopping models - Add Free() method to AIModel interface for proper GPU resource cleanup - Implement Free() in llama backend to release llama.cpp model resources - Add Free() stub implementations in base and SingleThread backends - Modify deleteProcess() to call Free() before stopping the process to ensure VRAM is properly released when models are unloaded Fixes issue where VRAM was not freed when stopping models, which could lead to memory exhaustion when running multiple models sequentially. * feat: Add Free RPC to backend.proto for VRAM cleanup\n\n- Add rpc Free(HealthMessage) returns (Result) {} to backend.proto\n- This RPC is required to properly expose the Free() method\n through the gRPC interface for VRAM resource cleanup\n\nRefs: PR #8739 * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-03 12:39:06 +01:00
Ettore Di Giacinto	1c8db3846d	chore(faster-qwen3-tts): Add anyio to requirements.txt Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-03 09:43:29 +01:00
LocalAI [bot]	5139719d59	chore(model gallery): 🤖 add 1 new models via gallery agent (#8743 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-03 09:33:11 +01:00
LocalAI [bot]	d846ad3a84	chore: ⬆️ Update ggml-org/llama.cpp to `4d828bd1ab52773ba9570cc008cf209eb4a8b2f5` (#8727 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-02 23:22:28 +01:00
dependabot[bot]	50cf3ff37f	chore(deps): bump github.com/google/go-containerregistry from 0.20.7 to 0.21.1 (#8736 ) chore(deps): bump github.com/google/go-containerregistry Bumps [github.com/google/go-containerregistry](https://github.com/google/go-containerregistry) from 0.20.7 to 0.21.1. - [Release notes](https://github.com/google/go-containerregistry/releases) - [Commits](https://github.com/google/go-containerregistry/compare/v0.20.7...v0.21.1) --- updated-dependencies: - dependency-name: github.com/google/go-containerregistry dependency-version: 0.21.1 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-02 21:44:40 +01:00
dependabot[bot]	74dff72551	chore(deps): bump go.opentelemetry.io/otel/metric from 1.40.0 to 1.41.0 (#8735 ) Bumps [go.opentelemetry.io/otel/metric](https://github.com/open-telemetry/opentelemetry-go) from 1.40.0 to 1.41.0. - [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases) - [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md) - [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/v1.40.0...v1.41.0) --- updated-dependencies: - dependency-name: go.opentelemetry.io/otel/metric dependency-version: 1.41.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-02 21:44:28 +01:00
dependabot[bot]	c4eeab0f7c	chore(deps): bump go.opentelemetry.io/otel from 1.40.0 to 1.41.0 (#8734 ) Bumps [go.opentelemetry.io/otel](https://github.com/open-telemetry/opentelemetry-go) from 1.40.0 to 1.41.0. - [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases) - [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md) - [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/v1.40.0...v1.41.0) --- updated-dependencies: - dependency-name: go.opentelemetry.io/otel dependency-version: 1.41.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-02 21:44:15 +01:00
dependabot[bot]	432e182af0	chore(deps): bump github.com/modelcontextprotocol/go-sdk from 1.3.0 to 1.4.0 (#8733 ) chore(deps): bump github.com/modelcontextprotocol/go-sdk Bumps [github.com/modelcontextprotocol/go-sdk](https://github.com/modelcontextprotocol/go-sdk) from 1.3.0 to 1.4.0. - [Release notes](https://github.com/modelcontextprotocol/go-sdk/releases) - [Commits](https://github.com/modelcontextprotocol/go-sdk/compare/v1.3.0...v1.4.0) --- updated-dependencies: - dependency-name: github.com/modelcontextprotocol/go-sdk dependency-version: 1.4.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-02 21:44:03 +01:00
dependabot[bot]	d1fcd19cd3	chore(deps): bump github.com/openai/openai-go/v3 from 3.19.0 to 3.24.0 (#8732 ) Bumps [github.com/openai/openai-go/v3](https://github.com/openai/openai-go) from 3.19.0 to 3.24.0. - [Release notes](https://github.com/openai/openai-go/releases) - [Changelog](https://github.com/openai/openai-go/blob/main/CHANGELOG.md) - [Commits](https://github.com/openai/openai-go/compare/v3.19.0...v3.24.0) --- updated-dependencies: - dependency-name: github.com/openai/openai-go/v3 dependency-version: 3.24.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-02 21:43:53 +01:00
dependabot[bot]	dc550395cb	chore(deps): bump actions/upload-artifact from 6 to 7 (#8730 ) Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 6 to 7. - [Release notes](https://github.com/actions/upload-artifact/releases) - [Commits](https://github.com/actions/upload-artifact/compare/v6...v7) --- updated-dependencies: - dependency-name: actions/upload-artifact dependency-version: '7' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-02 21:43:39 +01:00
dependabot[bot]	00ecc60372	chore(deps): bump actions/download-artifact from 7 to 8 (#8729 ) Bumps [actions/download-artifact](https://github.com/actions/download-artifact) from 7 to 8. - [Release notes](https://github.com/actions/download-artifact/releases) - [Commits](https://github.com/actions/download-artifact/compare/v7...v8) --- updated-dependencies: - dependency-name: actions/download-artifact dependency-version: '8' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-02 21:43:27 +01:00
LocalAI [bot]	11443dc299	chore(model-gallery): ⬆️ update checksum (#8728 ) ⬆️ Checksum updates in gallery/index.yaml Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-02 21:43:12 +01:00
LocalAI [bot]	6d182281cf	fix: allow reranking models configured with known_usecases (#8681 ) When a model is configured with 'known_usecases: [rerank]' in the YAML config, the reranking endpoint was not being matched because: 1. The GuessUsecases function only checked for backend == 'rerankers' 2. The syncKnownUsecasesFromString() was not being called when loading configs via yaml.Unmarshal in readModelConfigsFromFile This fix: 1. Updates GuessUsecases to also check if Reranking is explicitly set to true in the model config (in addition to checking backend type) 2. Adds syncKnownUsecasesFromString() calls after yaml.Unmarshal in readModelConfigsFromFile to ensure known_usecases are properly parsed Fixes #8658 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-02 19:00:18 +01:00
Ettore Di Giacinto	8f0c3cec39	chore(ci): disable CI actions Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-02 14:48:00 +01:00
LocalAI [bot]	2dd4e7cdc3	fix(qwen-tts): ensure all requirements files end with newline (#8724 ) - Add trailing newline to all requirements*.txt files in qwen-tts backend - This ensures proper file formatting and prevents potential issues with package installation tools that expect newline-terminated files	2026-03-02 13:56:11 +01:00
LocalAI [bot]	eca2c6e01c	fix: Implement responsive line wrapping for model names (#8209 ) (#8720 ) fix: Implement responsive line wrapping for model names on home page - Changed model name display from truncate to break-words - Increased max-width from 100px to 200px to allow more text - This fixes issue #8209 for responsive text wrapping on smaller screens Fixes: #8209 Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-02 13:54:58 +01:00
LocalAI [bot]	b61536c0f4	chore: ⬆️ Update ggml-org/llama.cpp to `319146247e643695f94a558e8ae686277dd4f8da` (#8707 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-02 10:08:51 +01:00
LocalAI [bot]	02d287a297	fix(ci): correct transformer backend path typo (#8712 ) * fix(ci): correct transformer backend path typo - Fix typo: 'transformer' -> 'transformers' in .github/workflows/test.yml - The original PR #8710 had a typo where 'transformers' was written as 'transformer' - This caused the build to fail as the directory is actually named 'transformers' - References: https://github.com/mudler/LocalAI/pull/8710 * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-02 10:08:28 +01:00
LocalAI [bot]	8b430c577b	feat: Add debug logging for pocket-tts voice issue #8244 (#8715 ) Adding debug logging to help investigate the pocket-tts custom voice finding issue (Issue #8244). This is a first step to understand how voices are being loaded and where the failure occurs. Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-02 09:24:59 +01:00
LocalAI [bot]	0063e5d68f	feat(swagger): update swagger (#8706 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-01 21:33:19 +01:00
Ettore Di Giacinto	c7c4a20a9e	fix: retry when LLM returns empty messages (#8704 ) * debug Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * retry instead of re-computing a response Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-01 21:32:38 +01:00
LocalAI [bot]	94539f3992	chore(model gallery): 🤖 add 1 new models via gallery agent (#8698 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-01 16:54:01 +01:00
LocalAI [bot]	525278658d	chore(model gallery): 🤖 add 1 new models via gallery agent (#8696 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-01 16:19:38 +01:00
LocalAI [bot]	919f801e25	chore(model gallery): 🤖 add 1 new models via gallery agent (#8695 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-01 15:58:02 +01:00
LocalAI [bot]	362eb261c5	chore(model gallery): 🤖 add 1 new models via gallery agent (#8694 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-01 15:40:43 +01:00
LocalAI [bot]	d407f4ead5	chore(model gallery): 🤖 add 1 new models via gallery agent (#8693 ) chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-01 15:25:08 +01:00
Ettore Di Giacinto	1fc8ad854f	fix(toolcall): consider also literal \n between tags Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-01 11:20:46 +01:00
Loryan Strant	f49a8edd87	docs: Update Home Assistant links in README.md (#8688 ) Update Home Assistant links in README.md Signed-off-by: Loryan Strant <51473494+loryanstrant@users.noreply.github.com>	2026-03-01 08:28:58 +01:00
Ettore Di Giacinto	510b830d2b	fix: simplify CI steps, fix gallery agent (#8685 ) chore: simplify CI steps, fix gallery agent Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-01 01:00:30 +01:00
LocalAI [bot]	ddb36468ed	chore: ⬆️ Update ggml-org/llama.cpp to `05728db18eea59de81ee3a7699739daaf015206b` (#8683 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-01 00:48:26 +01:00
Ettore Di Giacinto	983db7bedc	feat(ui): add model size estimation (#8684 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-28 23:03:47 +01:00
LocalAI [bot]	b260378694	docs: add TLS reverse proxy configuration guide (#8673 ) * docs: add TLS reverse proxy configuration guide Add documentation explaining how to use LocalAI behind a TLS termination reverse proxy (HAProxy, Apache, Nginx). The documentation covers: - How LocalAI detects HTTPS via X-Forwarded-Proto header - Required headers that must be forwarded - Configuration examples for HAProxy, Apache, and Nginx - Sub-path serving configuration - Testing and troubleshooting guide Fixes: Issue #7176 - Web UI broken behind TLS reverse proxy Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> * docs: remove non-existent --base-url option from sub-path section --------- Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-28 23:02:17 +01:00
LocalAI [bot]	b10443ab5a	feat(models): add model storage size display and RAM warning (#8675 ) Add model storage size display and RAM warning in Models tab - Backend (ui_api.go): - Added getDirectorySize() helper function to calculate total size of model files - Added storageSize, ramTotal, ramUsed, ramUsagePercent to /api/models endpoint response - Uses xsysinfo.GetSystemRAMInfo() for RAM information - Frontend (models.html): - Added storageSize, ramTotal, ramUsed, ramUsagePercent to Alpine.js data object - Added formatBytes() helper for human-readable byte formatting - Display storage size in hero header with blue indicator - Show warning banner when storage exceeds RAM (model too large for system) Addresses: https://github.com/mudler/LocalAI/issues/6251 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-28 22:05:01 +01:00
LocalAI [bot]	b647b6caf1	fix: properly sync model selection dropdown in video generation UI (#8680 ) fix(video): initialize model selection dropdown with current model value The Alpine.js link variable was starting empty, causing the dropdown selection to not reflect the currently selected model. This fix initializes the link variable with the current model value from the template (e.g., video/{{.Model}}), following the same pattern used in image.html. Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-28 13:11:33 +01:00
LocalAI [bot]	c187b160e7	fix(gallery): clean up partially downloaded backend on installation failure (#8679 ) When a backend download fails (e.g., on Mac OS with port conflicts causing connection issues), the backend directory is left with partial files. This causes subsequent installation attempts to fail with 'run file not found' because the sanity check runs on an empty/partial directory. This fix cleans up the backend directory when the initial download fails before attempting fallback URIs or mirrors. This ensures a clean state for retry attempts. Fixes: #8016 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-28 13:10:53 +01:00
LocalAI [bot]	42e580bed0	fix: whisper breaking on cuda-13 (use absolute path for CUDA directory detection) (#8678 ) fix: use absolute path for CUDA directory detection The capability detection was using a relative path 'usr/local/cuda-13' which doesn't work when LocalAI is run from a different working directory. This caused whisper (and other backends) to fail on CUDA-13 containers because the system incorrectly detected 'nvidia' capability instead of 'nvidia-cuda-13', leading to wrong backend selection (cuda12-whisper instead of cuda13-whisper). Fixes: https://github.com/mudler/LocalAI/issues/8033 Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-28 09:10:40 +01:00
LocalAI [bot]	5e13193d84	docs: add CDI driver config for NVIDIA GPU in containers (fix #8108 ) (#8677 ) This addresses issue #8108 where the legacy nvidia driver configuration causes container startup failures with newer NVIDIA Container Toolkit versions. Changes: - Update docker-compose example to show both CDI (recommended) and legacy nvidia driver options - Add troubleshooting section for 'Auto-detected mode as legacy' error - Document the fix for nvidia-container-cli 'invalid expression' errors The root cause is a Docker/NVIDIA Container Toolkit configuration issue, not a LocalAI code bug. The error occurs during the container runtime's prestart hook before LocalAI starts. Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-28 08:42:53 +01:00
Ettore Di Giacinto	1c5dc83232	chore(deps): bump llama.cpp to 'ecbcb7ea9d3303097519723b264a8b5f1e977028' (#8672 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-28 00:33:56 +01:00
LocalAI [bot]	73b997686a	chore: ⬆️ Update ggml-org/whisper.cpp to `9453b4b9be9b73adfc35051083f37cefa039acee` (#8671 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-27 21:28:48 +00:00
Ettore Di Giacinto	00abf1be1f	fix(qwen3.5): add qwen3.5 preset and mimick llama.cpp's PEG (#8668 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-27 12:15:00 +01:00
LocalAI [bot]	959458f0db	fix(gallery): add fallback URI resolution for backend installation (#8663 ) * fix(gallery): add fallback URI resolution for backend installation When a backend installation fails (e.g., due to missing 'latest-' tag), try fallback URIs in order: 1. Replace 'latest-' with 'master-' in the URI 2. If that fails, append '-development' to the backend name This fixes the issue where backend index entries don't match the repository tags. For example, installing 'ace-step' tries to download 'latest-gpu-nvidia-cuda-13-ace-step' but only 'master-gpu-nvidia-cuda-13-ace-step' exists in the quay.io registry. Fixes: #8437 Signed-off-by: localai-bot <139863280+localai-bot@users.noreply.github.com> * chore(gallery): make fallback URI patterns configurable via env vars --------- Signed-off-by: localai-bot <139863280+localai-bot@users.noreply.github.com>	2026-02-27 10:56:33 +01:00
LocalAI [bot]	dfc6efb88d	feat(backends): add faster-qwen3-tts (#8664 ) * feat(backends): add faster-qwen3-tts Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: this backend is CUDA only Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: add requirements-install.txt with setuptools for build isolation The faster-qwen3-tts backend requires setuptools to build packages like sox that have setuptools as a build dependency. This ensures the build completes successfully in CI. Signed-off-by: LocalAI Bot <localai-bot@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: LocalAI Bot <localai-bot@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-27 08:16:51 +01:00

1 2 3 4 5 ...

5678 Commits