LocalAI/backend/go at 7cbb743b2549200bee5ef6f228febcc96e6cb28a - LocalAI - Gitea: Git with a cup of tea

mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-25 00:59:28 -04:00

Files

History

Ettore Di Giacinto 7cbb743b25 feat(recon): enable cuDNN conv path on arm64+CUDA13 recon backends

The voice-detect.cpp / face-detect.cpp engines have an opt-in cuDNN
implicit-GEMM conv path behind VOICEDETECT_GGML_CUDNN / FACEDETECT_GGML_CUDNN
(default OFF) that kills im2col on the GPU and reaches torch-cuDNN parity
(SCRFD 2.3x, WeSpeaker/ERes2Net parity), measured on the GB10
(arm64, CUDA 13, sm_121a).

Enable it for the CUDA build, but only where cuDNN actually ships: the
arm64 + CUDA 13 image (GB10/Jetson/L4T). x86 CUDA images carry no cuDNN,
so flipping it on globally for BUILD_TYPE=cublas would be a link failure.
The Makefiles gate on CUDA_MAJOR_VERSION=13 + arch (TARGETARCH from the
matrix/Docker build, uname -m fallback for local builds).

backend/Dockerfile.golang already installs the runtime libcudnn9-cuda-13
in the arm64+CUDA13 apt block; add the matching libcudnn9-dev-cuda-13 so
the build-time link resolves.

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Assisted-by: Claude:claude-opus-4-8 [Claude Code]

2026-06-24 15:54:12 +00:00

..

chore(acestep-cpp): bump pin to ed53caf and adapt wrapper to new API (#9908 )

2026-05-20 21:05:32 +00:00

feat(ced): sound-event classification backend (CED audio tagger) (#10425 )

2026-06-22 01:00:28 +02:00

fix(distributed): self-heal stale 'model not loaded' routing (#10181 )

2026-06-05 09:01:36 +02:00

fix(crispasr): filter garbage words from parakeet word-level timestamps (#10421 )

2026-06-21 17:03:33 +02:00

depth-anything-cpp

feat(gallery): add Depth Anything V2 models + bump native version (#10413 )

2026-06-20 14:56:16 +02:00

feat(recon): enable cuDNN conv path on arm64+CUDA13 recon backends

2026-06-24 15:54:12 +00:00

feat: add distributed mode (#9124 )

2026-03-30 00:47:27 +02:00

feat(middleware): Model routing, PII filtering, Cloud model proxies (#9802 )

2026-05-25 09:28:27 +02:00

feat(localvqe/audio): v1.3 release and add spectrograms to audio transform UI (#10113 )

2026-05-31 23:56:46 +02:00

locate-anything-cpp

chore: ⬆️ Update mudler/locate-anything.cpp to 92c1682da792c1e8a5dec91acc2be4b02c742ded (#10282 )

2026-06-13 09:01:17 +02:00

chore: ⬆️ Update ServeurpersoCom/omnivoice.cpp to 96d30169afd5e6bb3fd6a0e9be0eb505bfe81fcd (#10408 )

2026-06-20 01:36:22 +02:00

feat: add distributed mode (#9124 )

2026-03-30 00:47:27 +02:00

chore: ⬆️ Update mudler/parakeet.cpp to db755a78d39f789bb7d4e3935158a9e8105dbe36 (#10393 )

2026-06-20 01:37:33 +02:00

fix(package.sh): drop redundant -a and -R

2026-02-05 16:39:38 +01:00

chore: ⬆️ Update ServeurpersoCom/qwentts.cpp to 4536dcdce27c3764a93a06d6bf64026b124962f5 (#10431 )

2026-06-22 01:00:10 +02:00

chore: ⬆️ Update mudler/rf-detr.cpp to 65c0ffcc9a9bc9dae38252f63d0417c9845a6cf7 (#10075 )

2026-05-30 00:55:41 +02:00

chore: add golangci-lint with new-from-merge-base baseline (#9603 )

2026-04-28 22:07:44 +02:00

feat(sherpa-onnx): add Kokoro TTS + multilingual Piper voices (#10309 )

2026-06-13 21:27:27 +02:00

fix(package.sh): drop redundant -a and -R

2026-02-05 16:39:38 +01:00

stablediffusion-ggml

chore: ⬆️ Update leejet/stable-diffusion.cpp to b12098f5d09fc83da36e65c784f7bdb16a5a5ebf (#10429 )

2026-06-22 00:57:33 +02:00

feat(supertonic): add Supertonic ONNX TTS backend (CPU) (#10342 )

2026-06-15 16:54:11 +02:00

fix(darwin): fix vibevoice-cpp build linkage + fail-safe go backend packaging (#10276 )

2026-06-12 23:13:50 +02:00

feat(recon): enable cuDNN conv path on arm64+CUDA13 recon backends

2026-06-24 15:54:12 +00:00

feat(whisper): honor client cancellation via ggml abort_callback (#9710 )

2026-05-08 01:44:47 +02:00

chore: ⬆️ Update ggml-org/whisper.cpp to 5ed76e9a079962f1c85cfce44edd325c27ef1f97 (#10396 )

2026-06-20 01:37:06 +02:00