LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-07-03 12:57:02 -04:00

Files

Ettore Di Giacinto ff3f0620de chore(paged): add current serving snapshot harness

Add a reusable current-stack paged-vs-vLLM serving snapshot harness that targets the clean DGX mirror, enforces idle/lock preflight, runs pre/post inference gates, and records ratio summaries.

Assisted-by: Codex:gpt-5

2026-07-01 03:19:36 +00:00

plans

chore(paged): add current serving snapshot harness

2026-07-01 03:19:36 +00:00

specs

docs(paged): gate MTP rollback safety

2026-07-01 02:15:11 +00:00