Ettore Di Giacinto
4d171e62bb
docs(paged): reject MTP serving lever
Add the repeatable MTP serving A/B runner and record Phase 15 results showing current llama-server MTP regresses GB10 serving throughput despite passing inference gates.
Assisted-by: Codex:gpt-5
2026-07-01 02:29:28 +00:00
..
2026-06-30 20:40:40 +00:00
2026-06-30 23:12:09 +00:00
2026-07-01 00:20:53 +00:00
2026-06-30 21:57:42 +00:00
2026-06-30 21:21:53 +00:00
2026-06-30 22:06:17 +00:00
2026-07-01 01:58:22 +00:00
2026-06-30 22:23:14 +00:00
2026-07-01 01:15:00 +00:00
2026-07-01 01:51:53 +00:00
2026-07-01 01:29:44 +00:00
2026-07-01 01:51:53 +00:00
2026-07-01 00:54:25 +00:00
2026-07-01 02:15:11 +00:00
2026-07-01 02:29:28 +00:00
2026-07-01 01:57:45 +00:00