mirror of
https://github.com/exo-explore/exo.git
synced 2026-03-06 15:17:36 -05:00
Repeatedly sends chat completions to Llama-3.2-1B-Instruct-4bit on a single node and detects when a request stalls for >5s. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>