Ettore Di Giacinto
48e22da165
chore(recon): bump pins to MLAS-class direct-conv engines (voice 7ecfd07, face be22d67)
Hand-tuned nChw16c AVX-512 register-tiled direct-conv microkernel (~263 GFLOP/s,
within 6-7% of MLAS per-op efficiency), runtime-CPUID-dispatched + AVX2 fallback,
fused bias/relu. voice 7ecfd07: default 3x3-s1 kernel for WeSpeaker (+37%/+32%)
+ ERes2Net, CAM++ pinned to Winograd. face be22d67: shape-gated to the ArcFace
recognizer body (+25-27% @8t); SCRFD detector stays on Winograd (no regression).
Parity cosine=1.0 / detect <=1px on AVX-512 + AVX2 paths. Portable single binaries.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Assisted-by: Claude:claude-opus-4-8 [Claude Code]
2026-06-23 16:25:39 +00:00
..
2026-05-20 21:05:32 +00:00
2026-06-22 01:00:28 +02:00
2026-06-05 09:01:36 +02:00
2026-06-21 17:03:33 +02:00
2026-06-20 14:56:16 +02:00
2026-06-23 16:25:39 +00:00
2026-03-30 00:47:27 +02:00
2026-05-25 09:28:27 +02:00
2026-05-31 23:56:46 +02:00
2026-06-13 09:01:17 +02:00
2026-06-20 01:36:22 +02:00
2026-03-30 00:47:27 +02:00
2026-06-20 01:37:33 +02:00
2026-02-05 16:39:38 +01:00
2026-06-22 01:00:10 +02:00
2026-05-30 00:55:41 +02:00
2026-04-28 22:07:44 +02:00
2026-06-13 21:27:27 +02:00
2026-02-05 16:39:38 +01:00
2026-06-22 00:57:33 +02:00
2026-06-15 16:54:11 +02:00
2026-06-12 23:13:50 +02:00
2026-06-23 16:25:39 +00:00
2026-05-08 01:44:47 +02:00
2026-06-20 01:37:06 +02:00