Ettore Di Giacinto
62c407ed55
docs(paged): lever1 gather-fusion bench landed - checkpoint + attribution (patch 0028)
Anchors the rigorous same-session A/B validation of patch 0028 (residual conv-state
tap k_get_rows fusion) on this worktree branch with sign-off attribution. The
regenerated 0028 patch + bench-updated LEVER1_GATHER_RESULTS.md first landed via a
concurrent origin/master merge (c1f1d1e8e) that swept the staged files; this records
the provenance and the bench summary in the checkpoint.
Gate (bit-exact, greedy --temp 0 --seed 1 -n 48): dense q36-27b-nvfp4
5951a5b4d624ce891e22ab5fca9bc439, MoE q36-35b-a3b-nvfp4 07db32c2bcb78d17a43ed18bc22705cd
(both == baseline; base == lever1). decode_agg npl128: dense 369.95 -> 377.83 t/s
(+2.13%, 96.6% of vLLM), MoE 763.47 -> 777.95 t/s (+1.90%, 86.3% of vLLM). nsys MoE
decode: k_get_rows_float 17334 -> 15414 inst (-1920), 358.37 -> 133.52 ms, step -3.13 ms.
Assisted-by: Claude:opus-4.8 [Claude Code]
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2026-06-26 21:41:45 +00:00
..
2026-06-22 09:22:36 +00:00
2026-06-22 09:22:36 +00:00
2026-06-22 09:22:36 +00:00
2026-06-22 09:22:36 +00:00
2026-06-22 09:22:36 +00:00
2026-06-22 10:14:27 +00:00
2026-06-22 10:47:10 +00:00
2026-06-26 14:12:36 +00:00
2026-06-22 18:04:09 +00:00
2026-06-22 20:37:12 +00:00
2026-06-22 22:38:28 +00:00
2026-06-23 09:13:08 +00:00
2026-06-26 14:12:36 +00:00
2026-06-23 13:49:15 +00:00
2026-06-26 14:12:36 +00:00
2026-06-26 14:12:36 +00:00
2026-06-24 17:58:00 +00:00
2026-06-24 22:45:49 +00:00
2026-06-24 23:47:51 +00:00
2026-06-25 10:41:38 +00:00
2026-06-25 16:56:35 +00:00
2026-06-25 18:34:17 +00:00
2026-06-25 21:49:15 +00:00
2026-06-26 10:44:33 +00:00
2026-06-26 14:53:14 +00:00
2026-06-26 19:51:00 +00:00
2026-06-26 21:38:56 +00:00
2026-06-24 21:45:42 +00:00
2026-06-26 17:44:05 +00:00
2026-06-26 17:44:05 +00:00
2026-06-22 09:22:36 +00:00
2026-06-26 19:10:24 +00:00
2026-06-26 19:10:24 +00:00
2026-06-26 03:47:24 +00:00
2026-06-25 16:46:59 +00:00
2026-06-26 00:49:49 +00:00
2026-06-26 06:22:08 +00:00
2026-06-25 16:55:25 +00:00
2026-06-25 15:24:49 +00:00
2026-06-23 22:48:31 +00:00
2026-06-25 16:56:35 +00:00
2026-06-25 15:03:18 +00:00
2026-06-22 15:44:24 +00:00
2026-06-25 09:06:50 +00:00
2026-06-26 09:12:55 +00:00
2026-06-26 03:51:35 +00:00
2026-06-24 14:31:35 +00:00
2026-06-26 06:22:08 +00:00
2026-06-24 11:21:44 +00:00
2026-06-25 15:27:04 +00:00
2026-06-26 21:41:45 +00:00
2026-06-26 21:38:56 +00:00
2026-06-25 10:41:38 +00:00
2026-06-26 21:26:14 +00:00
2026-06-23 19:04:55 +00:00
2026-06-26 20:11:40 +00:00
2026-06-26 20:14:30 +00:00
2026-06-23 13:17:03 +00:00
2026-06-25 21:49:15 +00:00
2026-06-23 13:49:15 +00:00
2026-06-25 21:49:15 +00:00
2026-06-25 18:34:17 +00:00
2026-06-26 09:42:55 +00:00
2026-06-24 10:56:13 +00:00
2026-06-22 12:59:09 +00:00
2026-06-22 11:50:01 +00:00
2026-06-26 10:44:33 +00:00
2026-06-22 14:16:52 +00:00
2026-06-22 13:48:01 +00:00
2026-06-26 10:44:33 +00:00
2026-06-26 10:44:33 +00:00
2026-06-26 14:12:36 +00:00
2026-06-26 03:51:35 +00:00
2026-06-26 03:51:35 +00:00
2026-06-26 03:51:35 +00:00
2026-06-25 22:42:08 +00:00
2026-06-23 12:22:15 +00:00
2026-06-26 14:56:53 +00:00
2026-06-24 23:47:51 +00:00
2026-06-24 17:58:00 +00:00
2026-06-24 07:44:07 +00:00