LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-05-17 04:56:52 -04:00

Author	SHA1	Message	Date
Ettore Di Giacinto	f891d60d26	fix(llama.cpp): bundle libdl, librt, libpthread in llama-cpp backend (#9099 ) chore(llama.cpp): bundle libdl, librt, libpthread in llama-cpp backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-22 00:58:14 +01:00
Ettore Di Giacinto	be25217955	chore(transformers): bump to >5.0 and generically load models (#9097 ) * chore(transformers): bump to >5.0 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: refactor to use generic model loading Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-22 00:57:54 +01:00
LocalAI [bot]	b74111feed	chore: ⬆️ Update ggml-org/llama.cpp to `990e4d96980d0b016a2b07049cc9031642fb9903` (#9095 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-22 00:57:39 +01:00
LocalAI [bot]	bf92117259	chore: ⬆️ Update ggml-org/whisper.cpp to `76684141a5d059be71cbe23dc2f0ed552213ba2d` (#9094 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-22 00:57:28 +01:00
Ettore Di Giacinto	031a36c995	feat: inferencing default, automatic tool parsing fallback and wire min_p (#9092 ) * feat: wire min_p Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: inferencing defaults Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(refactor): re-use iterative parser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: generate automatically inference defaults from unsloth Instead of trying to re-invent the wheel and maintain here the inference defaults, prefer to consume unsloth ones, and contribute there as necessary. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: apply defaults also to models installed via gallery Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: be consistent and apply fallback to all endpoint Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-22 00:57:15 +01:00
LocalAI [bot]	8036d22ec6	chore: ⬆️ Update ace-step/acestep.cpp to `7326a7bea0c2037982ec924f7364e998df70450c` (#9086 ) ⬆️ Update ace-step/acestep.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-22 00:56:52 +01:00
Ettore Di Giacinto	f7e8d9e791	feat(quantization): add quantization backend (#9096 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-22 00:56:34 +01:00
LocalAI [bot]	aa3e82976e	chore: ⬆️ Update ggml-org/llama.cpp to `4cb7e0bd61e7e1101e8ab10db5dee70c5717a386` (#9087 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-21 09:41:11 +01:00
Ettore Di Giacinto	d9c1db2b87	feat: add (experimental) fine-tuning support with TRL (#9088 ) * feat: add fine-tuning endpoint Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(experimental): add fine-tuning endpoint and TRL support This changeset defines new GRPC signatues for Fine tuning backends, and add TRL backend as initial fine-tuning engine. This implementation also supports exporting to GGUF and automatically importing it to LocalAI after fine-tuning. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * commit TRL backend, stop by killing process Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * move fine-tune to generic features Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add evals, reorder menu Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-21 02:08:02 +01:00
Richard Palethorpe	73bdc3b50d	fix(realtime): Set the alias for opus so the development backend can be selected (#9083 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-20 15:08:07 +01:00
Ettore Di Giacinto	c3174f9543	chore(deps): bump llama-cpp to 'a0bbcdd9b6b83eeeda6f1216088f42c33d464e38' (#9079 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-20 08:12:21 +01:00
LocalAI [bot]	7d81bf0aa3	chore: ⬆️ Update ggml-org/whisper.cpp to `9386f239401074690479731c1e41683fbbeac557` (#9077 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-19 23:27:35 +01:00
LocalAI [bot]	9a9da062e1	chore: ⬆️ Update ggml-org/llama.cpp to `5744d7ec430e2f875a393770195fda530560773f` (#9063 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-19 07:58:30 +01:00
LocalAI [bot]	dd1a8b174f	chore: ⬆️ Update ggml-org/whisper.cpp to `ef3463bb29ef90d25dfabfd1e75993111c52412d` (#9062 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-19 07:58:11 +01:00
LocalAI [bot]	8560a1e571	chore: ⬆️ Update ace-step/acestep.cpp to `ab020a9aefcd364423e0665da12babc6b0c7b507` (#9046 ) ⬆️ Update ace-step/acestep.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-18 08:54:15 +01:00
LocalAI [bot]	29c33e6a6a	chore: ⬆️ Update ggml-org/whisper.cpp to `dc9611662265870df22a7230b7586176a99c1955` (#9045 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-18 08:46:35 +01:00
LocalAI [bot]	a58475dbef	chore: ⬆️ Update ggml-org/llama.cpp to `ee4801e5a6ee7ee4063144ab44ab4e127f76fba8` (#9044 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-18 08:46:12 +01:00
LocalAI [bot]	e21ad5cfaa	chore: ⬆️ Update leejet/stable-diffusion.cpp to `545fac4f3fb0117a4e962b1a04cf933a7e635933` (#9036 ) ⬆️ Update leejet/stable-diffusion.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-17 18:07:30 +01:00
LocalAI [bot]	5c5e537b31	chore: ⬆️ Update ace-step/acestep.cpp to `15740f4301b3ec3020875f1fb975a6cfdb2f6767` (#9038 ) ⬆️ Update ace-step/acestep.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-17 10:22:53 +01:00
LocalAI [bot]	118bcee196	chore: ⬆️ Update ggml-org/llama.cpp to `9b342d0a9f2f4892daec065491583ec2be129685` (#9039 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-17 10:22:42 +01:00
LocalAI [bot]	3eabd6d1d0	chore: ⬆️ Update ggml-org/whisper.cpp to `79218f51d02ffe70575ef7fba3496dfc7adda027` (#9037 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-17 08:25:31 +01:00
LocalAI [bot]	b2030255ca	chore: ⬆️ Update ggml-org/llama.cpp to `88915cb55c14769738fcab7f1c6eaa6dcc9c2b0c` (#9020 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-16 00:10:11 +01:00
LocalAI [bot]	9f903ec06e	chore: ⬆️ Update leejet/stable-diffusion.cpp to `862a6586cb6fcec037c14f9ed902329ecec7d990` (#9019 ) ⬆️ Update leejet/stable-diffusion.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-16 00:09:59 +01:00
LocalAI [bot]	87525109f1	chore: ⬆️ Update ggml-org/llama.cpp to `3a6f059909ed5dab8587df5df4120315053d57a4` (#9009 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-15 09:46:45 +01:00
Ettore Di Giacinto	c596d8a5d9	fix: Change baseDir assignment to use ModelPath (#9010 ) Fixes: https://github.com/mudler/LocalAI/issues/9005 Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-15 09:45:58 +01:00
LocalAI [bot]	977063c4ba	chore: ⬆️ Update ggml-org/llama.cpp to `e30f1fdf74ea9238ff562901aa974c75aab6619b` (#8997 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-14 01:16:42 +01:00
LocalAI [bot]	0ec3ea4a46	fix(acestep-cpp): resolve relative model paths in options (#8993 ) * fix(acestep-cpp): resolve relative model paths in options The acestep-cpp backend was failing to load models because the model paths in options (text_encoder_model, dit_model, vae_model) were being passed to the C++ code without resolving their relative paths. When a user configures acestep-cpp-turbo-4b, the model paths are specified as relative paths like 'acestep-cpp/acestep-v15-turbo-Q8_0.gguf'. The backend was passing these paths directly to the C++ code without joining them with the model directory. This fix: 1. Gets the base directory from the ModelFile path 2. Resolves all relative paths in options to be absolute paths 3. Adds debug logging to show resolved paths for troubleshooting Fixes #8991 Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * test: fix acestep tests to not join modeldir in options According to code review feedback, the Options array in TestLoadModel and TestSoundGeneration should contain just the model filenames without filepath.Join with modelDir. The model paths are handled internally by the backend. * fix: change bpm parameter type to float32 to match C++ API signature * test: fix TestLoadModel and TestSoundGeneration to use baseDir for model paths - Modified TestLoadModel to compute baseDir from main model path and use it for relative model paths - Modified TestSoundGeneration similarly to use baseDir for model paths - Changed bpm parameter type from int32 to float32 to match C++ API * Apply suggestions from code review Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: localai-bot <localai-bot@users.noreply.github.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: localai-bot <localai-bot@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-14 01:16:13 +01:00
Richard Palethorpe	f9a850c02a	feat(realtime): WebRTC support (#8790 ) * feat(realtime): WebRTC support Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(tracing): Show full LLM opts and deltas Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-13 21:37:15 +01:00
LocalAI [bot]	ff21bc6cbb	chore: ⬆️ Update ace-step/acestep.cpp to `5aa065445541094cba934299cd498bbb9fa5c434` (#8984 ) ⬆️ Update ace-step/acestep.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-13 07:59:34 +01:00
LocalAI [bot]	46a8941a2c	chore: ⬆️ Update ggml-org/llama.cpp to `57819b8d4b39d893408e51520dff3d47d1ebb757` (#8983 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-13 07:59:15 +01:00
LocalAI [bot]	c0351b8e6a	Remove HuggingFace backend support (#8971 ) * Remove HuggingFace backend support, restore other backends - Removed backend/go/huggingface directory and all related files - Removed pkg/langchain/huggingface.go - Removed LCHuggingFaceBackend from pkg/model/initializers.go - Removed huggingface backend entries from backend/index.yaml - Updated backend/README.md to remove HuggingFace backend reference - Restored kitten-tts, local-store, silero-vad, piper backends that were incorrectly removed This change removes only HuggingFace backend support from LocalAI as per the P0 priority request in issue #8963, while preserving other backends (kitten-tts, local-store, silero-vad, piper). Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> * Remove huggingface backend from test.yml build command The tests-linux CI job was failing because it was trying to build the huggingface backend which no longer exists after the backend removal. This removes huggingface from the build command in test.yml. * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-13 01:09:30 +01:00
Ettore Di Giacinto	a738f8b0e4	feat(backends): add ace-step.cpp (#8965 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-12 18:56:26 +01:00
Richard Palethorpe	b24ca51287	fix(llama-cpp): Set enable_thinking in the correct place (#8973 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-03-12 13:32:29 +01:00
Ettore Di Giacinto	7dc691c171	feat: add fish-speech backend (#8962 ) * feat: add fish-speech backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * drop portaudio Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-12 07:48:23 +01:00
Attila Györffy	5a67b5d73c	Fix image upload processing and img2img pipeline in diffusers backend (#8879 ) * fix: add missing bufio.Flush in processImageFile The processImageFile function writes decoded image data (from base64 or URL download) through a bufio.NewWriter but never calls Flush() before closing the underlying file. Since bufio's default buffer is 4096 bytes, small images produce 0-byte files and large images are truncated — causing PIL to fail with "cannot identify image file". This breaks all image input paths: file, files, and ref_images parameters in /v1/images/generations, making img2img, inpainting, and reference image features non-functional. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> * fix: merge options into kwargs in diffusers GenerateImage The GenerateImage method builds a local `options` dict containing the source image (PIL), negative_prompt, and num_inference_steps, but never merges it into `kwargs` before calling self.pipe(*kwargs). This causes img2img to fail with "Input is in incorrect format" because the pipeline never receives the image parameter. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> test: add unit test for processImageFile base64 decoding Verifies that a base64-encoded PNG survives the write path (encode → decode → bufio.Write → Flush → file on disk) with byte-for-byte fidelity. The test image is small enough to fit entirely in bufio's 4096-byte buffer, which is the exact scenario where the missing Flush() produced a 0-byte file. Also tests that invalid base64 input is handled gracefully. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> * test: verify GenerateImage merges options into pipeline kwargs Mocks the diffusers pipeline and calls GenerateImage with a source image and negative prompt. Asserts that the pipeline receives the image, negative_prompt, and num_inference_steps via kwargs — the exact parameters that were silently dropped before the fix. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> * fix: move kwargs.update(options) earlier in GenerateImage Move the options merge right after self.options merge (L742) so that image, negative_prompt, and num_inference_steps are available to all downstream code paths including img2vid and txt2vid. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> * test: convert processImageFile tests to ginkgo Replace standard testing with ginkgo/gomega to be consistent with the rest of the test suites in the project. Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> --------- Signed-off-by: Attila Györffy <attila+git@attilagyorffy.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-11 08:05:50 +01:00
LocalAI [bot]	270eb956c7	chore: ⬆️ Update ggml-org/llama.cpp to `10e5b148b061569aaee8ae0cf72a703129df0eab` (#8946 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-11 08:04:09 +01:00
LocalAI [bot]	b48920ecf6	chore: ⬆️ Update ggml-org/llama.cpp to `23fbfcb1ad6c6f76b230e8895254de785000be46` (#8921 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-10 07:30:43 +01:00
LocalAI [bot]	515cd968ae	chore: ⬆️ Update leejet/stable-diffusion.cpp to `d6dd6d7b555c233bb9bc9f20b4751eb8c9269743` (#8925 ) ⬆️ Update leejet/stable-diffusion.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-10 07:29:54 +01:00
Ettore Di Giacinto	a026277ab9	feat(mlx-distributed): add new MLX-distributed backend (#8801 ) * feat(mlx-distributed): add new MLX-distributed backend Add new MLX distributed backend with support for both TCP and RDMA for model sharding. This implementation ties in the discovery implementation already in place, and re-uses the same P2P mechanism for the TCP MLX-distributed inferencing. The Auto-parallel implementation is inspired by Exo's ones (who have been added to acknowledgement for the great work!) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose a CLI to facilitate backend starting Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: make manual rank0 configurable via model configs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing features from mlx backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-09 17:29:32 +01:00
LocalAI [bot]	f06c02d10e	chore: ⬆️ Update ggml-org/llama.cpp to `35bee031e17ed2b2e8e7278b284a6c8cd120d9f8` (#8872 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-08 22:25:04 +01:00
Ettore Di Giacinto	b2f81bfa2e	feat(functions): add peg-based parsing and allow backends to return tool calls directly (#8838 ) * feat(functions): add peg-based parsing Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: support returning toolcalls directly from backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: do run PEG only if backend didn't send deltas Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-08 22:21:57 +01:00
LocalAI [bot]	85e4871d4d	chore: ⬆️ Update leejet/stable-diffusion.cpp to c8fb3d245858d495be1f140efdcfaa0d49de41e5 (#8841 ) * chore: ⬆️ update stable-diffusion.cpp to `c8fb3d245858d495be1f140efdcfaa0d49de41e5` Update stablediffusion-ggml to include fix for SD1 Pix2Pix issue (leejet/stable-diffusion.cpp#1329). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Signed-off-by: localai-bot <localai-bot@noreply.github.com> * fix: address CI failures in stablediffusion update Signed-off-by: localai-bot <localai-bot@noreply.github.com> * fix: resolve remaining CI failures in stablediffusion update - Move flow_shift to global scope so gen_image() can access the value set during load_model() (was causing compilation error) - Fix sd_type_str array: TQ1_0 should be at index 34, TQ2_0 at index 35 to match upstream SD_TYPE_TQ1_0=34, SD_TYPE_TQ2_0=35 enum values Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Signed-off-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: localai-bot <localai-bot@noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-08 09:53:08 +01:00
Weathercold	f347495de9	fix(qwen-tts): duplicate instruct argument in voice design mode (#8842 ) Don't pass instruct because it is added to kwargs Fixes the error `qwen_tts.inference.qwen3_tts_model.Qwen3TTSModel.generate_voice_design() got multiple values for keyword argument 'instruct'` Signed-off-by: Weathercold <weathercold.scr@proton.me>	2026-03-08 08:48:22 +01:00
LocalAI [bot]	1296167f84	chore: ⬆️ Update ggml-org/llama.cpp to `c5a778891ba0ddbd4cbb507c823f970595b1adc2` (#8837 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-07 23:28:06 +01:00
LocalAI [bot]	e1df6807dc	chore: ⬆️ Update ggml-org/llama.cpp to `566059a26b0ce8faec4ea053605719d399c64cc5` (#8822 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-06 23:53:23 +01:00
Ettore Di Giacinto	580517f9db	feat: pass-by metadata to predict options (#8795 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-05 22:50:10 +01:00
LocalAI [bot]	0cf7c18177	chore: ⬆️ Update ggml-org/llama.cpp to `a0ed91a442ea6b013bd42ebc3887a81792eaefa1` (#8797 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-05 22:49:45 +01:00
LocalAI [bot]	ac91413eb2	chore: ⬆️ Update ggml-org/whisper.cpp to `30c5194c9691e4e9a98b3dea9f19727397d3f46e` (#8796 ) ⬆️ Update ggml-org/whisper.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-05 22:49:32 +01:00
LocalAI [bot]	f25e450414	chore: ⬆️ Update ggml-org/llama.cpp to `24d2ee052795063afffc9732465ca1b1c65f4a28` (#8777 ) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-03-04 23:25:48 +01:00
Andres	454d8adc76	feat(qwen-tts): Support using multiple voices (#8757 ) * Add support for multiple voice clones in Qwen TTS Signed-off-by: Andres Smith <andressmithdev@pm.me> * Add voice prompt caching and generation logs to see generation time --------- Signed-off-by: Andres Smith <andressmithdev@pm.me> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-03-04 09:47:21 +01:00

1 2 3 4 5 ...

1064 Commits