Ettore Di Giacinto
7c45447c9e
docs(paged): FUTURE_LEVERS - parked decode-parity exploration trail
...
Ranked pick-up points after the 95%-bit-exact plateau: hybrid-precision SSM state
(per-head f32/bf16 split - the bf16 error is concentrated in long-memory heads, so
a split could capture most of the +25-31% while passing the f32 KL gate), dense
CUDA-graph instability, the rms_norm->fp4 fold (flat-risk), datacenter Blackwell
sm_100 (no LPDDR5x floor), adaptive prefill budget, MoE-specific recurrence tuning.
Assisted-by: Claude:opus-4.8 [Claude Code]
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2026-06-26 00:53:09 +00:00
..
2026-06-22 09:22:36 +00:00
2026-06-22 09:22:36 +00:00
2026-06-22 09:22:36 +00:00
2026-06-22 09:22:36 +00:00
2026-06-22 09:22:36 +00:00
2026-06-22 10:14:27 +00:00
2026-06-22 10:47:10 +00:00
2026-06-22 15:03:16 +00:00
2026-06-22 18:04:09 +00:00
2026-06-22 20:37:12 +00:00
2026-06-22 22:38:28 +00:00
2026-06-23 09:13:08 +00:00
2026-06-23 09:55:32 +00:00
2026-06-23 13:49:15 +00:00
2026-06-23 19:04:55 +00:00
2026-06-24 07:48:20 +00:00
2026-06-24 17:58:00 +00:00
2026-06-24 22:45:49 +00:00
2026-06-24 23:47:51 +00:00
2026-06-25 10:41:38 +00:00
2026-06-25 16:56:35 +00:00
2026-06-25 18:34:17 +00:00
2026-06-25 21:49:15 +00:00
2026-06-24 21:45:42 +00:00
2026-06-22 09:22:36 +00:00
2026-06-25 16:46:59 +00:00
2026-06-26 00:49:49 +00:00
2026-06-26 00:49:49 +00:00
2026-06-25 16:55:25 +00:00
2026-06-25 15:24:49 +00:00
2026-06-23 22:48:31 +00:00
2026-06-25 16:56:35 +00:00
2026-06-25 15:03:18 +00:00
2026-06-22 15:44:24 +00:00
2026-06-25 09:06:50 +00:00
2026-06-24 14:31:35 +00:00
2026-06-26 00:53:09 +00:00
2026-06-24 11:21:44 +00:00
2026-06-25 15:27:04 +00:00
2026-06-25 10:41:38 +00:00
2026-06-23 19:04:55 +00:00
2026-06-23 13:17:03 +00:00
2026-06-25 21:49:15 +00:00
2026-06-23 13:49:15 +00:00
2026-06-25 21:49:15 +00:00
2026-06-25 18:34:17 +00:00
2026-06-24 10:56:13 +00:00
2026-06-22 12:59:09 +00:00
2026-06-22 11:50:01 +00:00
2026-06-22 14:16:52 +00:00
2026-06-22 13:48:01 +00:00
2026-06-23 21:39:22 +00:00
2026-06-25 22:42:08 +00:00
2026-06-23 12:22:15 +00:00
2026-06-24 23:47:51 +00:00
2026-06-24 17:58:00 +00:00
2026-06-24 07:44:07 +00:00