Files
LocalAI/backend/cpp/llama-cpp-localai-paged/docs
Ettore Di Giacinto 902bcc7717 docs(paged): validate TTFT prefill-first A/B
Record Phase56 MoE and lower-concurrency validation for the TTFT prefill-first policy, including DGX gates and the opt-in-only decision.

Assisted-by: Codex:gpt-5
2026-07-01 10:05:23 +00:00
..