LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-07-02 04:16:56 -04:00

Author	SHA1	Message	Date
Attila Györffy	5a67b5d73c	Fix image upload processing and img2img pipeline in diffusers backend (#8879 ) * fix: add missing bufio.Flush in processImageFile The processImageFile function writes decoded image data (from base64 or URL download) through a bufio.NewWriter but never calls Flush() before closing the underlying file. Since bufio's default buffer is 4096 bytes, small images produce 0-byte files and large images are truncated — causing PIL to fail with "cannot identify image file". This breaks all image input paths: file, files, and ref_images parameters in /v1/images/generations, making img2img, inpainting, and reference image features non-functional. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> * fix: merge options into kwargs in diffusers GenerateImage The GenerateImage method builds a local `options` dict containing the source image (PIL), negative_prompt, and num_inference_steps, but never merges it into `kwargs` before calling self.pipe(*kwargs). This causes img2img to fail with "Input is in incorrect format" because the pipeline never receives the image parameter. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> test: add unit test for processImageFile base64 decoding Verifies that a base64-encoded PNG survives the write path (encode → decode → bufio.Write → Flush → file on disk) with byte-for-byte fidelity. The test image is small enough to fit entirely in bufio's 4096-byte buffer, which is the exact scenario where the missing Flush() produced a 0-byte file. Also tests that invalid base64 input is handled gracefully. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> * test: verify GenerateImage merges options into pipeline kwargs Mocks the diffusers pipeline and calls GenerateImage with a source image and negative prompt. Asserts that the pipeline receives the image, negative_prompt, and num_inference_steps via kwargs — the exact parameters that were silently dropped before the fix. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> * fix: move kwargs.update(options) earlier in GenerateImage Move the options merge right after self.options merge (L742) so that image, negative_prompt, and num_inference_steps are available to all downstream code paths including img2vid and txt2vid. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> * test: convert processImageFile tests to ginkgo Replace standard testing with ginkgo/gomega to be consistent with the rest of the test suites in the project. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> --------- Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-11 08:05:50 +01:00
LocalAI [bot]	270eb956c7	chore: ⬆️ Update ggml-org/llama.cpp to `10e5b148b061569aaee8ae0cf72a703129df0eab` (#8946 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-11 08:04:09 +01:00
LocalAI [bot]	b48920ecf6	chore: ⬆️ Update ggml-org/llama.cpp to `23fbfcb1ad6c6f76b230e8895254de785000be46` (#8921 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-10 07:30:43 +01:00
LocalAI [bot]	515cd968ae	chore: ⬆️ Update leejet/stable-diffusion.cpp to `d6dd6d7b555c233bb9bc9f20b4751eb8c9269743` (#8925 ) ⬆️ Update leejet/stable-diffusion.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-10 07:29:54 +01:00
Ettore Di Giacinto	a026277ab9	feat(mlx-distributed): add new MLX-distributed backend (#8801 ) * feat(mlx-distributed): add new MLX-distributed backend Add new MLX distributed backend with support for both TCP and RDMA for model sharding. This implementation ties in the discovery implementation already in place, and re-uses the same P2P mechanism for the TCP MLX-distributed inferencing. The Auto-parallel implementation is inspired by Exo's ones (who have been added to acknowledgement for the great work!) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose a CLI to facilitate backend starting Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: make manual rank0 configurable via model configs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing features from mlx backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-09 17:29:32 +01:00
LocalAI [bot]	f06c02d10e	chore: ⬆️ Update ggml-org/llama.cpp to `35bee031e17ed2b2e8e7278b284a6c8cd120d9f8` (#8872 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-08 22:25:04 +01:00
Ettore Di Giacinto	b2f81bfa2e	feat(functions): add peg-based parsing and allow backends to return tool calls directly (#8838 ) * feat(functions): add peg-based parsing Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: support returning toolcalls directly from backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: do run PEG only if backend didn't send deltas Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-08 22:21:57 +01:00
LocalAI [bot]	85e4871d4d	chore: ⬆️ Update leejet/stable-diffusion.cpp to c8fb3d245858d495be1f140efdcfaa0d49de41e5 (#8841 ) * chore: ⬆️ update stable-diffusion.cpp to `c8fb3d245858d495be1f140efdcfaa0d49de41e5` Update stablediffusion-ggml to include fix for SD1 Pix2Pix issue (leejet/stable-diffusion.cpp#1329). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Signed-off-by: localai-bot <localai-bot@noreply.github.com> * fix: address CI failures in stablediffusion update Signed-off-by: localai-bot <localai-bot@noreply.github.com> * fix: resolve remaining CI failures in stablediffusion update - Move flow_shift to global scope so gen_image() can access the value set during load_model() (was causing compilation error) - Fix sd_type_str array: TQ1_0 should be at index 34, TQ2_0 at index 35 to match upstream SD_TYPE_TQ1_0=34, SD_TYPE_TQ2_0=35 enum values Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-08 09:53:08 +01:00
Weathercold	f347495de9	fix(qwen-tts): duplicate instruct argument in voice design mode (#8842 ) Don't pass instruct because it is added to kwargs Fixes the error `qwen_tts.inference.qwen3_tts_model.Qwen3TTSModel.generate_voice_design() got multiple values for keyword argument 'instruct'` Signed-off-by: Weathercold <weathercold.scr@proton.me>	2026-03-08 08:48:22 +01:00
LocalAI [bot]	1296167f84	chore: ⬆️ Update ggml-org/llama.cpp to `c5a778891ba0ddbd4cbb507c823f970595b1adc2` (#8837 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-07 23:28:06 +01:00
LocalAI [bot]	e1df6807dc	chore: ⬆️ Update ggml-org/llama.cpp to `566059a26b0ce8faec4ea053605719d399c64cc5` (#8822 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-06 23:53:23 +01:00
Ettore Di Giacinto	580517f9db	feat: pass-by metadata to predict options (#8795 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-05 22:50:10 +01:00
LocalAI [bot]	0cf7c18177	chore: ⬆️ Update ggml-org/llama.cpp to `a0ed91a442ea6b013bd42ebc3887a81792eaefa1` (#8797 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-05 22:49:45 +01:00
LocalAI [bot]	ac91413eb2	chore: ⬆️ Update ggml-org/whisper.cpp to `30c5194c9691e4e9a98b3dea9f19727397d3f46e` (#8796 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-05 22:49:32 +01:00
LocalAI [bot]	f25e450414	chore: ⬆️ Update ggml-org/llama.cpp to `24d2ee052795063afffc9732465ca1b1c65f4a28` (#8777 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-04 23:25:48 +01:00
Andres	454d8adc76	feat(qwen-tts): Support using multiple voices (#8757 ) * Add support for multiple voice clones in Qwen TTS Signed-off-by: Andres Smith <andressmithdev@pm.me> * Add voice prompt caching and generation logs to see generation time --------- Signed-off-by: Andres Smith <andressmithdev@pm.me> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-04 09:47:21 +01:00
LocalAI [bot]	6002c940a9	chore: ⬆️ Update ggml-org/llama.cpp to `ecd99d6a9acbc436bad085783bcd5d0b9ae9e9e9` (#8762 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-04 08:08:37 +01:00
LocalAI [bot]	6e5a58ca70	feat: Add Free RPC to backend.proto for VRAM cleanup (#8751 ) * fix: Add VRAM cleanup when stopping models - Add Free() method to AIModel interface for proper GPU resource cleanup - Implement Free() in llama backend to release llama.cpp model resources - Add Free() stub implementations in base and SingleThread backends - Modify deleteProcess() to call Free() before stopping the process to ensure VRAM is properly released when models are unloaded Fixes issue where VRAM was not freed when stopping models, which could lead to memory exhaustion when running multiple models sequentially. * feat: Add Free RPC to backend.proto for VRAM cleanup\n\n- Add rpc Free(HealthMessage) returns (Result) {} to backend.proto\n- This RPC is required to properly expose the Free() method\n through the gRPC interface for VRAM resource cleanup\n\nRefs: PR #8739 * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-03 12:39:06 +01:00
Ettore Di Giacinto	1c8db3846d	chore(faster-qwen3-tts): Add anyio to requirements.txt Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-03 09:43:29 +01:00
LocalAI [bot]	d846ad3a84	chore: ⬆️ Update ggml-org/llama.cpp to `4d828bd1ab52773ba9570cc008cf209eb4a8b2f5` (#8727 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-02 23:22:28 +01:00
LocalAI [bot]	2dd4e7cdc3	fix(qwen-tts): ensure all requirements files end with newline (#8724 ) - Add trailing newline to all requirements*.txt files in qwen-tts backend - This ensures proper file formatting and prevents potential issues with package installation tools that expect newline-terminated files	2026-03-02 13:56:11 +01:00
LocalAI [bot]	b61536c0f4	chore: ⬆️ Update ggml-org/llama.cpp to `319146247e643695f94a558e8ae686277dd4f8da` (#8707 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-02 10:08:51 +01:00
LocalAI [bot]	8b430c577b	feat: Add debug logging for pocket-tts voice issue #8244 (#8715 ) Adding debug logging to help investigate the pocket-tts custom voice finding issue (Issue #8244). This is a first step to understand how voices are being loaded and where the failure occurs. Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-03-02 09:24:59 +01:00
LocalAI [bot]	ddb36468ed	chore: ⬆️ Update ggml-org/llama.cpp to `05728db18eea59de81ee3a7699739daaf015206b` (#8683 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-01 00:48:26 +01:00
Ettore Di Giacinto	1c5dc83232	chore(deps): bump llama.cpp to 'ecbcb7ea9d3303097519723b264a8b5f1e977028' (#8672 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-28 00:33:56 +01:00
LocalAI [bot]	73b997686a	chore: ⬆️ Update ggml-org/whisper.cpp to `9453b4b9be9b73adfc35051083f37cefa039acee` (#8671 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-27 21:28:48 +00:00
LocalAI [bot]	dfc6efb88d	feat(backends): add faster-qwen3-tts (#8664 ) * feat(backends): add faster-qwen3-tts Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: this backend is CUDA only Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: add requirements-install.txt with setuptools for build isolation The faster-qwen3-tts backend requires setuptools to build packages like sox that have setuptools as a build dependency. This ensures the build completes successfully in CI. Signed-off-by: LocalAI Bot <localai-bot@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: LocalAI Bot <localai-bot@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-27 08:16:51 +01:00
LocalAI [bot]	8ad40091a6	chore: ⬆️ Update ggml-org/llama.cpp to `723c71064da0908c19683f8c344715fbf6d986fd` (#8660 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-26 21:34:47 +00:00
LocalAI [bot]	fb86f6461d	chore: ⬆️ Update ggml-org/llama.cpp to `3769fe6eb70b0a0fbb30b80917f1caae68c902f7` (#8655 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-26 00:05:03 +01:00
Ettore Di Giacinto	b032cf489b	fix(chatterbox): add support for cuda13/aarch64 (#8653 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-25 21:51:44 +01:00
dependabot[bot]	c4783a0a05	chore(deps): bump grpcio from 1.76.0 to 1.78.1 in /backend/python/vllm (#8635 ) Bumps [grpcio](https://github.com/grpc/grpc) from 1.76.0 to 1.78.1. - [Release notes](https://github.com/grpc/grpc/releases) - [Commits](https://github.com/grpc/grpc/compare/v1.76.0...v1.78.1) --- updated-dependencies: - dependency-name: grpcio dependency-version: 1.78.1 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-02-25 08:17:32 +01:00
dependabot[bot]	c44f03b882	chore(deps): bump grpcio from 1.76.0 to 1.78.1 in /backend/python/rerankers (#8636 ) chore(deps): bump grpcio in /backend/python/rerankers Bumps [grpcio](https://github.com/grpc/grpc) from 1.76.0 to 1.78.1. - [Release notes](https://github.com/grpc/grpc/releases) - [Commits](https://github.com/grpc/grpc/compare/v1.76.0...v1.78.1) --- updated-dependencies: - dependency-name: grpcio dependency-version: 1.78.1 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-02-25 08:16:57 +01:00
dependabot[bot]	eeec92af78	chore(deps): bump sentence-transformers from 5.2.2 to 5.2.3 in /backend/python/transformers (#8638 ) chore(deps): bump sentence-transformers in /backend/python/transformers Bumps [sentence-transformers](https://github.com/huggingface/sentence-transformers) from 5.2.2 to 5.2.3. - [Release notes](https://github.com/huggingface/sentence-transformers/releases) - [Commits](https://github.com/huggingface/sentence-transformers/compare/v5.2.2...v5.2.3) --- updated-dependencies: - dependency-name: sentence-transformers dependency-version: 5.2.3 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-02-25 08:16:41 +01:00
dependabot[bot]	842033b8b5	chore(deps): bump grpcio from 1.76.0 to 1.78.1 in /backend/python/transformers (#8640 ) chore(deps): bump grpcio in /backend/python/transformers Bumps [grpcio](https://github.com/grpc/grpc) from 1.76.0 to 1.78.1. - [Release notes](https://github.com/grpc/grpc/releases) - [Commits](https://github.com/grpc/grpc/compare/v1.76.0...v1.78.1) --- updated-dependencies: - dependency-name: grpcio dependency-version: 1.78.1 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-02-25 08:14:55 +01:00
dependabot[bot]	a2941228a7	chore(deps): bump grpcio from 1.76.0 to 1.78.1 in /backend/python/common/template (#8641 ) chore(deps): bump grpcio in /backend/python/common/template Bumps [grpcio](https://github.com/grpc/grpc) from 1.76.0 to 1.78.1. - [Release notes](https://github.com/grpc/grpc/releases) - [Commits](https://github.com/grpc/grpc/compare/v1.76.0...v1.78.1) --- updated-dependencies: - dependency-name: grpcio dependency-version: 1.78.1 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-02-25 08:14:43 +01:00
dependabot[bot]	791e6b84ee	chore(deps): bump grpcio from 1.76.0 to 1.78.1 in /backend/python/coqui (#8642 ) Bumps [grpcio](https://github.com/grpc/grpc) from 1.76.0 to 1.78.1. - [Release notes](https://github.com/grpc/grpc/releases) - [Commits](https://github.com/grpc/grpc/compare/v1.76.0...v1.78.1) --- updated-dependencies: - dependency-name: grpcio dependency-version: 1.78.1 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-02-25 08:14:30 +01:00
LocalAI [bot]	1331e23b67	chore: ⬆️ Update ggml-org/llama.cpp to `418dea39cea85d3496c8b04a118c3b17f3940ad8` (#8649 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-25 00:04:48 +00:00
LocalAI [bot]	9a5b5ee8a9	chore: ⬆️ Update ggml-org/llama.cpp to `b68a83e641b3ebe6465970b34e99f3f0e0a0b21a` (#8628 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-23 22:02:40 +00:00
LocalAI [bot]	f40c8dd0ce	chore: ⬆️ Update ggml-org/llama.cpp to `2b6dfe824de8600c061ef91ce5cc5c307f97112c` (#8622 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-23 09:30:58 +00:00
LocalAI [bot]	91f2dd5820	chore: ⬆️ Update ggml-org/llama.cpp to `f75c4e8bf52ea480ece07fd3d9a292f1d7f04bc5` (#8619 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-22 13:20:08 +01:00
LocalAI [bot]	fcecc12e57	chore: ⬆️ Update ggml-org/llama.cpp to `ba3b9c8844aca35ecb40d31886686326f22d2214` (#8613 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-02-21 09:57:04 +01:00
LocalAI [bot]	bb0924dff1	chore: ⬆️ Update ggml-org/llama.cpp to `b908baf1825b1a89afef87b09e22c32af2ca6548` (#8612 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-20 23:47:47 +01:00
LocalAI [bot]	b1c434f0fc	chore: ⬆️ Update ggml-org/llama.cpp to `11c325c6e0666a30590cde390d5746a405e536b9` (#8607 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-19 23:32:35 +01:00
LocalAI [bot]	bb42b342de	chore: ⬆️ Update ggml-org/whisper.cpp to `21411d81ea736ed5d9cdea4df360d3c4b60a4adb` (#8606 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-19 23:32:21 +01:00
LocalAI [bot]	e555057f8b	fix: multi-GPU support for Diffusers (Issue #8575 ) (#8605 ) * chore: init * feat: implement multi-GPU support for Diffusers backend (fixes #8575) --------- Co-authored-by: localai-bot <localai-bot@users.noreply.github.com>	2026-02-19 21:35:58 +01:00
Ettore Di Giacinto	dadc7158fb	fix(diffusers): sd_embed is not always available (#8602 ) Seems sd_embed doesn't play well with MPS and L4T. Making it optional Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-19 10:45:17 +01:00
LocalAI [bot]	68c7077491	chore: ⬆️ Update ggml-org/llama.cpp to `b55dcdef5dcd74dc75c4921090e928d43453c157` (#8599 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-18 22:33:25 +01:00
LocalAI [bot]	ed832cf0e0	chore: ⬆️ Update ggml-org/llama.cpp to `2b089c77580d347767f440205103e4da8ec33d89` (#8592 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-02-17 22:35:07 +00:00
Richard Palethorpe	9e692967c3	fix(llama-cpp): Pass parameters when using embedded template (#8590 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-17 18:50:05 +01:00
LocalAI [bot]	067a255435	chore: ⬆️ Update ggml-org/llama.cpp to `d612901116ab2066c7923372d4827032ff296bc4` (#8588 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-02-17 00:57:32 +01:00

1 2 3 4 5 ...

1030 Commits