LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-07-01 20:07:18 -04:00

Author	SHA1	Message	Date
LocalAI [bot]	ad57cdfefe	chore: ⬆️ Update leejet/stable-diffusion.cpp to `f16a110f8776398ef23a2a6b7b57522c2471637a` (#9167 ) ⬆️ Update leejet/stable-diffusion.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-30 08:26:45 +02:00
Richard Palethorpe	c2f7d1c18b	feat(ui): Add media history to studio pages (e.g. past images) (#9151 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-30 00:49:55 +02:00
ER-EPR	afe79568d6	fix: huggingface repo change the file name so Update index.yaml is needed (#9163 ) * Update index.yaml Signed-off-by: ER-EPR <38782737+ER-EPR@users.noreply.github.com> * Add mmproj files for Qwen3.5 models Signed-off-by: ER-EPR <38782737+ER-EPR@users.noreply.github.com> * Update file paths for Qwen models in index.yaml Signed-off-by: ER-EPR <38782737+ER-EPR@users.noreply.github.com> --------- Signed-off-by: ER-EPR <38782737+ER-EPR@users.noreply.github.com>	2026-03-30 00:48:17 +02:00
Ettore Di Giacinto	59108fbe32	feat: add distributed mode (#9124 ) * feat: add distributed mode (experimental) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix data races, mutexes, transactions Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactorings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix events and tool stream in agent chat Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * use ginkgo Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(cron): compute correctly time boundaries avoiding re-triggering Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * enhancements, refactorings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * do not flood of healthy checks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * do not list obvious backends as text backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * tests fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop redundant healthcheck Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * enhancements, refactorings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-30 00:47:27 +02:00
LocalAI [bot]	4c870288d9	chore: ⬆️ Update ggml-org/llama.cpp to `59d840209a5195c2f6e2e81b5f8339a0637b59d9` (#9144 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-28 18:18:06 +01:00
Ettore Di Giacinto	8da7212763	fix(ci): checkout submodules Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-28 00:33:31 +01:00
Ettore Di Giacinto	6e76052f9d	ci: set gh-pages Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-27 21:26:55 +00:00
Richard Palethorpe	cf84db36ec	fix(voxcpm): Force using a recent voxcpm version to kick the dependency solver (#9150 ) fix(voxcpm): Allow packages to be fetched from all indexes Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-27 15:38:51 +01:00
Richard Palethorpe	d3f629f183	feat: Merge repeated log lines in the terminal (#9141 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-26 22:16:13 +01:00
Richard Palethorpe	b1aa707a92	fix(coqui,nemo,voxcpm): Add dependencies to allow CI to progress (#9142 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-26 18:03:56 +01:00
LocalAI [bot]	731176ce3a	feat(swagger): update swagger (#9136 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-26 07:58:11 +01:00
LocalAI [bot]	b86fa63f70	chore: ⬆️ Update ggml-org/llama.cpp to `a970515bdb0b1d09519106847660b0d0c84d2472` (#9137 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-26 07:56:41 +01:00
walcz-de	00fcf6936c	fix: implement encoding_format=base64 for embeddings endpoint (#9135 ) The OpenAI Node.js SDK v4+ sends encoding_format=base64 by default. LocalAI previously ignored this parameter and always returned a float JSON array, causing a silent data corruption bug in any Node.js client (AnythingLLM Desktop, LangChain.js, LlamaIndex.TS, …): // What the client does when it expects base64 but receives a float array: Buffer.from(floatArray, 'base64') Node.js treats a non-string first argument as a byte array — each float32 value is truncated to a single byte — and Float32Array then reads those bytes as floats, yielding dims/4 values. Vector databases (Qdrant, pgvector, …) then create collections with the wrong dimension, causing all similarity searches to fail silently. e.g. granite-embedding-107m (384 dims) → 96 stored in Qdrant jina-embeddings-v3 (1024 dims) → 256 stored in Qdrant Changes: - core/schema/prediction.go: add EncodingFormat string field to PredictionOptions so the request parameter is parsed and available throughout the request pipeline - core/schema/openai.go: add EmbeddingBase64 string field to Item; add MarshalJSON so the "embedding" JSON key emits either []float32 or a base64 string depending on which field is populated — all other Item consumers (image, video endpoints) are unaffected - core/http/endpoints/openai/embeddings.go: add floatsToBase64() which packs a float32 slice as little-endian bytes and base64-encodes it; add embeddingItem() helper; both InputToken and InputStrings loops now honour encoding_format=base64 Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-25 17:38:07 +01:00
Richard Palethorpe	26384c5c70	fix(docs): Use notice instead of alert (#9134 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-25 13:55:48 +01:00
LocalAI [bot]	7209457f53	chore: ⬆️ Update ace-step/acestep.cpp to `6f35c874ee11e86d511b860019b84976f5b52d3a` (#9128 ) ⬆️ Update ace-step/acestep.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-25 07:52:31 +01:00
LocalAI [bot]	9bc68b2721	chore: ⬆️ Update ggml-org/llama.cpp to `9f102a1407ed5d73b8c954f32edab50f8dfa3f58` (#9127 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-25 07:52:14 +01:00
Richard Palethorpe	7bdd198fd3	fix(downloader): Rewrite full https HF URI with HF_ENDPOINT (#9107 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-24 18:32:52 +01:00
dependabot[bot]	b296e3d94b	chore(deps): bump github.com/mudler/skillserver from 0.0.5 to 0.0.6 (#9116 ) Bumps [github.com/mudler/skillserver](https://github.com/mudler/skillserver) from 0.0.5 to 0.0.6. - [Release notes](https://github.com/mudler/skillserver/releases) - [Commits](https://github.com/mudler/skillserver/compare/v0.0.5...v0.0.6) --- updated-dependencies: - dependency-name: github.com/mudler/skillserver dependency-version: 0.0.6 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-24 08:51:02 +01:00
dependabot[bot]	c91855a9b2	chore(deps): bump peter-evans/create-pull-request from 7 to 8 (#9114 ) Bumps [peter-evans/create-pull-request](https://github.com/peter-evans/create-pull-request) from 7 to 8. - [Release notes](https://github.com/peter-evans/create-pull-request/releases) - [Commits](https://github.com/peter-evans/create-pull-request/compare/v7...v8) --- updated-dependencies: - dependency-name: peter-evans/create-pull-request dependency-version: '8' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-24 08:50:50 +01:00
dependabot[bot]	e8e445cd43	chore(deps): bump actions/checkout from 4 to 6 (#9110 ) Bumps [actions/checkout](https://github.com/actions/checkout) from 4 to 6. - [Release notes](https://github.com/actions/checkout/releases) - [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md) - [Commits](https://github.com/actions/checkout/compare/v4...v6) --- updated-dependencies: - dependency-name: actions/checkout dependency-version: '6' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-24 08:50:36 +01:00
dependabot[bot]	735c426072	chore(deps): bump github.com/modelcontextprotocol/go-sdk from 1.4.0 to 1.4.1 (#9118 ) chore(deps): bump github.com/modelcontextprotocol/go-sdk Bumps [github.com/modelcontextprotocol/go-sdk](https://github.com/modelcontextprotocol/go-sdk) from 1.4.0 to 1.4.1. - [Release notes](https://github.com/modelcontextprotocol/go-sdk/releases) - [Commits](https://github.com/modelcontextprotocol/go-sdk/compare/v1.4.0...v1.4.1) --- updated-dependencies: - dependency-name: github.com/modelcontextprotocol/go-sdk dependency-version: 1.4.1 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-24 00:36:23 +01:00
dependabot[bot]	0976b8a17b	chore(deps): bump github.com/google/go-containerregistry from 0.21.2 to 0.21.3 (#9121 ) chore(deps): bump github.com/google/go-containerregistry Bumps [github.com/google/go-containerregistry](https://github.com/google/go-containerregistry) from 0.21.2 to 0.21.3. - [Release notes](https://github.com/google/go-containerregistry/releases) - [Commits](https://github.com/google/go-containerregistry/compare/v0.21.2...v0.21.3) --- updated-dependencies: - dependency-name: github.com/google/go-containerregistry dependency-version: 0.21.3 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-03-24 00:35:55 +01:00
LocalAI [bot]	2ad8c149e0	chore: ⬆️ Update ggml-org/llama.cpp to `1772701f99dd3fc13f5783b282c2361eda8ca47c` (#9123 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-24 00:35:40 +01:00
LocalAI [bot]	31fcb1425d	chore: ⬆️ Update ggml-org/llama.cpp to `49bfddeca18e62fa3d39114a23e9fcbdf8a22388` (#9102 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-23 01:11:18 +01:00
LocalAI [bot]	470d5e506f	feat(swagger): update swagger (#9103 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-22 21:28:20 +01:00
Ettore Di Giacinto	0ee49cf42e	Fix formatting in LocalAI description Updated the description formatting for LocalAI. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-22 21:28:07 +01:00
Ettore Di Giacinto	cecd8d6aa5	chore(docs): simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-22 20:24:44 +00:00
Ettore Di Giacinto	15935e9d5f	fix(auth): do not allow to register in invite mode (#9101 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-22 20:44:03 +01:00
Ettore Di Giacinto	5d410e5a03	fix(download): do not remove dst dir until we try all fallbacks (#9100 ) This actually caused fallbacks to be compeletely no-op as we were removing the destination dir before calling containerd.Apply Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-22 10:29:57 +01:00
Ettore Di Giacinto	5df77d7e8c	Remove how-tos section link from README Removed outdated link to community curated how-tos section. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-22 10:05:13 +01:00
Ettore Di Giacinto	f891d60d26	fix(llama.cpp): bundle libdl, librt, libpthread in llama-cpp backend (#9099 ) chore(llama.cpp): bundle libdl, librt, libpthread in llama-cpp backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-22 00:58:14 +01:00
Ettore Di Giacinto	be25217955	chore(transformers): bump to >5.0 and generically load models (#9097 ) * chore(transformers): bump to >5.0 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: refactor to use generic model loading Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-22 00:57:54 +01:00
LocalAI [bot]	b74111feed	chore: ⬆️ Update ggml-org/llama.cpp to `990e4d96980d0b016a2b07049cc9031642fb9903` (#9095 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-22 00:57:39 +01:00
LocalAI [bot]	bf92117259	chore: ⬆️ Update ggml-org/whisper.cpp to `76684141a5d059be71cbe23dc2f0ed552213ba2d` (#9094 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-22 00:57:28 +01:00
Ettore Di Giacinto	031a36c995	feat: inferencing default, automatic tool parsing fallback and wire min_p (#9092 ) * feat: wire min_p Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: inferencing defaults Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(refactor): re-use iterative parser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: generate automatically inference defaults from unsloth Instead of trying to re-invent the wheel and maintain here the inference defaults, prefer to consume unsloth ones, and contribute there as necessary. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: apply defaults also to models installed via gallery Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: be consistent and apply fallback to all endpoint Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-22 00:57:15 +01:00
LocalAI [bot]	8036d22ec6	chore: ⬆️ Update ace-step/acestep.cpp to `7326a7bea0c2037982ec924f7364e998df70450c` (#9086 ) ⬆️ Update ace-step/acestep.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-22 00:56:52 +01:00
Ettore Di Giacinto	f7e8d9e791	feat(quantization): add quantization backend (#9096 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-22 00:56:34 +01:00
Ettore Di Giacinto	4b183b7bb6	feat: add quota system (#9090 ) * feat: add quota system Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-21 10:09:49 +01:00
Ettore Di Giacinto	f38e91d80b	feat(ui): add predictor for usage, user-breakdown statistics (#9091 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-21 10:09:36 +01:00
LocalAI [bot]	aa3e82976e	chore: ⬆️ Update ggml-org/llama.cpp to `4cb7e0bd61e7e1101e8ab10db5dee70c5717a386` (#9087 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-21 09:41:11 +01:00
Ettore Di Giacinto	d9c1db2b87	feat: add (experimental) fine-tuning support with TRL (#9088 ) * feat: add fine-tuning endpoint Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(experimental): add fine-tuning endpoint and TRL support This changeset defines new GRPC signatues for Fine tuning backends, and add TRL backend as initial fine-tuning engine. This implementation also supports exporting to GGUF and automatically importing it to LocalAI after fine-tuning. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * commit TRL backend, stop by killing process Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * move fine-tune to generic features Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add evals, reorder menu Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-21 02:08:02 +01:00
LocalAI [bot]	f7e3aab4fc	feat(swagger): update swagger (#9085 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-20 21:45:03 +01:00
Richard Palethorpe	73bdc3b50d	fix(realtime): Set the alias for opus so the development backend can be selected (#9083 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-20 15:08:07 +01:00
Richard Palethorpe	cb63bdb9e4	feat(ui): Add model pipeline editor (#9070 ) This creates a new model config page. Presently just allows configuring pipelines, but can be extending the future to other types of models. However pipelines are quite easy to create a form for and require editing to create. Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-20 15:07:34 +01:00
Richard Palethorpe	8cd3f9fc47	feat(ui, openai): Structured errors and link to traces in error toast (#9068 ) First when sending errors over SSE we now clearly identify them as such instead of just sending the error string as a chat completion message. We use this in the UI to identify errors and link to them to the traces. Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-20 15:06:07 +01:00
lif	e0ab1a8b43	fix: use exact tag matching for model gallery tag filtering (#9041 ) The Search() method uses strings.Contains() on comma-joined tags, causing substring false positives (e.g., "asr" matching "image-diffusers"). Add FilterByTag() method that checks each tag with strings.EqualFold() for exact, case-insensitive matching. Add 'tag' query parameter to /api/models and /api/backends endpoints. Update the React frontend to send filter selections as 'tag' instead of 'term'. Closes #8775 Signed-off-by: majiayu000 <1835304752@qq.com>	2026-03-20 08:37:45 +01:00
Ettore Di Giacinto	c3174f9543	chore(deps): bump llama-cpp to 'a0bbcdd9b6b83eeeda6f1216088f42c33d464e38' (#9079 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-20 08:12:21 +01:00
LocalAI [bot]	2b12875302	fix: Add tracing settings loading from runtime_settings.json (#9081 ) Tracing settings (EnableTracing and TracingMaxItems) were not being loaded from runtime_settings.json on startup, causing tracing settings configured via WebUI to be lost after service restart. This fix adds proper loading of tracing settings in loadRuntimeSettingsFromFile function in core/application/startup.go. Fixes #9072 Co-authored-by: localai-bot <localai-bot@localai.io>	2026-03-20 00:58:52 +01:00
Ettore Di Giacinto	9cdbd89c1f	chore(agents.md): update with auth/feature gating instructions Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-19 22:52:28 +00:00
LocalAI [bot]	7d81bf0aa3	chore: ⬆️ Update ggml-org/whisper.cpp to `9386f239401074690479731c1e41683fbbeac557` (#9077 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-19 23:27:35 +01:00

1 2 3 4 5 ...

5879 Commits