From 628b8a8e0126d359ad6b43fdb5f08c0292e20c9b Mon Sep 17 00:00:00 2001 From: Ettore Di Giacinto Date: Wed, 24 Jun 2026 13:03:48 +0000 Subject: [PATCH] chore(recon): bump pins to GPU persistent-graph + multi-model-safe cache (voice 45d2e6b, face 0a4799a) GPU wins (CUDA/ggml backend, no CPU-path change): persistent per-shape graph+context cache in Backend::compute() eliminates the per-call cudaGraph re-instantiation churn -> wav2vec2 emotion+age-gender now AT GPU parity with torch-cuDNN on GB10 (0.97-0.98x), CAM++ -5.7ms; bit-identical parity. Cache hardened multi-model-safe (invalidate-on-free keyed by the ModelLoader weights buffer) so LocalAI multi-model hosting cannot stale-hit. Conv models still trail cuDNN (im2col-materialization-bound) - cuDNN implicit-GEMM lever next. Signed-off-by: Ettore Di Giacinto Assisted-by: Claude:claude-opus-4-8 [Claude Code] --- backend/go/face-detect/Makefile | 4 ++-- backend/go/voice-detect/Makefile | 4 ++-- 2 files changed, 4 insertions(+), 4 deletions(-) diff --git a/backend/go/face-detect/Makefile b/backend/go/face-detect/Makefile index 1c1ee4e81..1d1da8824 100644 --- a/backend/go/face-detect/Makefile +++ b/backend/go/face-detect/Makefile @@ -1,6 +1,6 @@ # face-detect backend Makefile. # -# Upstream pin lives below as FACEDETECT_VERSION?=47fdab6... (.github/bump_deps.sh +# Upstream pin lives below as FACEDETECT_VERSION?=0a4799a... (.github/bump_deps.sh # can find and update it - matches the voice-detect / parakeet.cpp / whisper.cpp # convention). # @@ -14,7 +14,7 @@ # The default target below does the proper clone-at-pin + cmake build so CI does # not need a side-checkout. -FACEDETECT_VERSION?=47fdab66af84bec8949e683f9f8ab80d7d37253f +FACEDETECT_VERSION?=0a4799a8a1febf666487e708160dba0c363d2af2 FACEDETECT_REPO?=https://github.com/mudler/face-detect.cpp GOCMD?=go diff --git a/backend/go/voice-detect/Makefile b/backend/go/voice-detect/Makefile index b70db8f84..0eb1629fd 100644 --- a/backend/go/voice-detect/Makefile +++ b/backend/go/voice-detect/Makefile @@ -1,6 +1,6 @@ # voice-detect backend Makefile. # -# Upstream pin lives below as VOICEDETECT_VERSION?=99b1804... (.github/bump_deps.sh +# Upstream pin lives below as VOICEDETECT_VERSION?=45d2e6b... (.github/bump_deps.sh # can find and update it - matches the parakeet.cpp / whisper.cpp / ds4 convention). # # Local dev shortcut: if you already have an out-of-tree voice-detect.cpp build, @@ -13,7 +13,7 @@ # The default target below does the proper clone-at-pin + cmake build so CI does # not need a side-checkout. -VOICEDETECT_VERSION?=99b18045295b389c1085c510cc7554b043bb8404 +VOICEDETECT_VERSION?=45d2e6b074bd4ca124e6ea2b39f9f549578e294f VOICEDETECT_REPO?=https://github.com/mudler/voice-detect.cpp GOCMD?=go