From 48e22da165b3d399c96207f8f69e2693afdb053f Mon Sep 17 00:00:00 2001 From: Ettore Di Giacinto Date: Tue, 23 Jun 2026 16:25:39 +0000 Subject: [PATCH] chore(recon): bump pins to MLAS-class direct-conv engines (voice 7ecfd07, face be22d67) Hand-tuned nChw16c AVX-512 register-tiled direct-conv microkernel (~263 GFLOP/s, within 6-7% of MLAS per-op efficiency), runtime-CPUID-dispatched + AVX2 fallback, fused bias/relu. voice 7ecfd07: default 3x3-s1 kernel for WeSpeaker (+37%/+32%) + ERes2Net, CAM++ pinned to Winograd. face be22d67: shape-gated to the ArcFace recognizer body (+25-27% @8t); SCRFD detector stays on Winograd (no regression). Parity cosine=1.0 / detect <=1px on AVX-512 + AVX2 paths. Portable single binaries. Signed-off-by: Ettore Di Giacinto Assisted-by: Claude:claude-opus-4-8 [Claude Code] --- backend/go/face-detect/Makefile | 4 ++-- backend/go/voice-detect/Makefile | 4 ++-- 2 files changed, 4 insertions(+), 4 deletions(-) diff --git a/backend/go/face-detect/Makefile b/backend/go/face-detect/Makefile index 00d20bf01..1ac3fc4b7 100644 --- a/backend/go/face-detect/Makefile +++ b/backend/go/face-detect/Makefile @@ -1,6 +1,6 @@ # face-detect backend Makefile. # -# Upstream pin lives below as FACEDETECT_VERSION?=2d2d5f0... (.github/bump_deps.sh +# Upstream pin lives below as FACEDETECT_VERSION?=be22d67... (.github/bump_deps.sh # can find and update it - matches the voice-detect / parakeet.cpp / whisper.cpp # convention). # @@ -14,7 +14,7 @@ # The default target below does the proper clone-at-pin + cmake build so CI does # not need a side-checkout. -FACEDETECT_VERSION?=2d2d5f05d4db6718ce8f0a7fc400796ddf510f40 +FACEDETECT_VERSION?=be22d67145a8bcd879f45ad33fbea03131c5922b FACEDETECT_REPO?=https://github.com/mudler/face-detect.cpp GOCMD?=go diff --git a/backend/go/voice-detect/Makefile b/backend/go/voice-detect/Makefile index 2da764ced..7344929ab 100644 --- a/backend/go/voice-detect/Makefile +++ b/backend/go/voice-detect/Makefile @@ -1,6 +1,6 @@ # voice-detect backend Makefile. # -# Upstream pin lives below as VOICEDETECT_VERSION?=f7b9f89... (.github/bump_deps.sh +# Upstream pin lives below as VOICEDETECT_VERSION?=7ecfd07... (.github/bump_deps.sh # can find and update it - matches the parakeet.cpp / whisper.cpp / ds4 convention). # # Local dev shortcut: if you already have an out-of-tree voice-detect.cpp build, @@ -13,7 +13,7 @@ # The default target below does the proper clone-at-pin + cmake build so CI does # not need a side-checkout. -VOICEDETECT_VERSION?=f7b9f89dc004f096e32e367e24b154765fe7584c +VOICEDETECT_VERSION?=7ecfd07a50a831d2e15ee3c2d72a669703f84f47 VOICEDETECT_REPO?=https://github.com/mudler/voice-detect.cpp GOCMD?=go