LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-28 18:37:43 -04:00

Author	SHA1	Message	Date
Ettore Di Giacinto	449a51ff0b	fix(fish-speech): allow invalid_reference_casting so tokenizers builds on darwin On darwin arm64 the fish-speech editable install (pip install --no-build-isolation -e) compiles the transitive `tokenizers` Python package's Rust extension from source, because there is no prebuilt manylinux wheel for that platform (Linux builds never compile it, so this only breaks on macOS). The pinned tokenizers crate fish-speech's stack resolves to contains a `&T` -> `&mut T` cast that the macOS CI runner's newer Rust toolchain rejects via the now-deny-by-default `invalid_reference_casting` lint: error: casting `&T` to `&mut T` is undefined behavior ... error: could not compile `tokenizers` (lib) due to 1 previous error ERROR: Failed building wheel for tokenizers This failed the fish-speech darwin/metal (mps) backend image build in the v4.5.5 release CI while all Linux variants built fine. Fix: export RUSTFLAGS with `-A invalid_reference_casting` (appended to any existing value, not clobbering) before installRequirements so the unchanged third-party crate compiles as it did under the older toolchain. Version-agnostic and harmless on Linux, where no Rust compile happens. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-06-28 09:27:02 +00:00
LocalAI [bot]	8aba4fdba3	chore(fish-speech): drop the darwin/metal build target (#10561 ) The fish-speech metal-darwin-arm64 backend build has been failing on every release (v4.5.3, v4.5.4, v4.5.5) and is a standing red on the darwin backend matrix. fish-speech pulls `tokenizers` transitively from its upstream source (`pip install -e fish-speech-src`), and on darwin/arm64 there is no prebuilt wheel for the pinned old `tokenizers` version, so pip builds it from source. Modern rustc rejects that old crate as a hard error: error: casting `&T` to `&mut T` is undefined behavior ... --> tokenizers-lib/src/models/bpe/trainer.rs:517:47 = note: `#[deny(invalid_reference_casting)]` on by default error: could not compile `tokenizers` (lib) due to 1 previous error This is deterministic, not a flake, and there is no clean fix that does not either pin a stale Rust toolchain or downgrade a soundness lint guarding real UB. Until upstream fish-speech moves to a tokenizers version that compiles on current toolchains, drop darwin support so the release backend build stays green. The Linux/CUDA/ROCm/Intel/L4T variants are unaffected. Removes: - the `-metal-darwin-arm64-fish-speech` entry from `includeDarwin` in backend-matrix.yml - the `metal:` capability mappings and the concrete `metal-fish-speech` / `metal-fish-speech-development` gallery entries in backend/index.yaml - the now-unused darwin-only requirements-mps.txt Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>	2026-06-27 23:24:21 +02:00
LocalAI [bot]	0258f8af55	fix(backends): repair release CI build/test breaks (kokoros, fish-speech, llama-cpp-quantization, sglang) (#10547 ) * fix(kokoros): implement new Backend RPCs to fix the build The backend.proto grew six RPCs (SoundDetection, Depth, TokenClassify, Score and the bidi-streaming Forward) that the kokoros gRPC service never implemented, so the trait impl no longer satisfies `Backend`: error[E0046]: not all trait items implemented, missing: `sound_detection`, `depth`, `token_classify`, `score`, `ForwardStream`, `forward` kokoros is a TTS backend with no use for these, so add `unimplemented` stubs (plus the `ForwardStream` associated type) matching the existing pattern for every other unsupported RPC in this file. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Assisted-by: Claude:claude-opus-4-8 [Claude Code] * fix(fish-speech): add setuptools-rust for the editable source install install.sh installs the fish-speech source tree editable with `--no-build-isolation`, which means the build backends of its transitive dependencies must already be present in the venv. One of them builds a Rust extension and its metadata step fails with: ModuleNotFoundError: No module named 'setuptools_rust' Add setuptools-rust to requirements.txt so installRequirements provisions it before the editable install runs. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Assisted-by: Claude:claude-opus-4-8 [Claude Code] * fix(llama-cpp-quantization): vendor convert_hf_to_gguf.py with conversion/ Upstream llama.cpp split the model-specific logic out of the single convert_hf_to_gguf.py file into a sibling `conversion/` package, so the script now starts with `from conversion import ...`. Downloading just the one file therefore fails at runtime with: ModuleNotFoundError: No module named 'conversion' Clone the repo (reusing the clone already needed to build llama-quantize) and copy both the script and the `conversion/` package into the backend dir. Python puts the script's own directory on sys.path[0], so the package resolves when it sits beside the script. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Assisted-by: Claude:claude-opus-4-8 [Claude Code] * fix(sglang): pin the CPU source build to sglang v0.5.11 The CPU profile builds sgl-kernel from a `git clone` of sglang with no ref, so it always tracks master. Recent master added CPU kernels (e.g. mamba/fla.cpp) that fail to compile in our builder: constexpr variable 'scale' must be initialized by a constant static library kineto_LIBRARY-NOTFOUND not found Pin the clone to v0.5.11, the same release the GPU path already floors on (requirements-cublas12-after.txt). Overridable via SGLANG_VERSION so the pin can be bumped deliberately. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Assisted-by: Claude:claude-opus-4-8 [Claude Code] --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>	2026-06-27 09:42:22 +02:00
Ettore Di Giacinto	151ad271f2	feat(rocm): bump to 7.x (#9323 ) feat(rocm): bump to 7.2.1 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-04-12 08:51:30 +02:00
Ettore Di Giacinto	59108fbe32	feat: add distributed mode (#9124 ) * feat: add distributed mode (experimental) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix data races, mutexes, transactions Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactorings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix events and tool stream in agent chat Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * use ginkgo Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(cron): compute correctly time boundaries avoiding re-triggering Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * enhancements, refactorings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * do not flood of healthy checks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * do not list obvious backends as text backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * tests fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring and consolidation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop redundant healthcheck Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * enhancements, refactorings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-30 00:47:27 +02:00
Ettore Di Giacinto	7dc691c171	feat: add fish-speech backend (#8962 ) * feat: add fish-speech backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * drop portaudio Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-03-12 07:48:23 +01:00

6 Commits