LocalAI/core at 9f41e69bc3b1faa89f3202223fc0b599fa175fee - LocalAI - Gitea: Git with a cup of tea

mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-05 07:16:10 -04:00

Files

History

Ettore Di Giacinto 9f41e69bc3 fix(distributed): self-heal stale 'model not loaded' routing

In distributed mode the registry can list a model as loaded on a node
while the worker has evicted it (autonomous LRU eviction, an out-of-band
unload, etc.) yet the backend process survives. The router's cached-node
check only verifies the process is alive (probeHealth), so it routes there
and inference fails with "<backend>: model not loaded" — and stays broken
until the controller restarts and rebuilds its registry.

InFlightTrackingClient now reconciles this: when a tracked inference call
returns a model-not-loaded error, it drops the stale replica row
(RemoveNodeModel) so the next request reloads the model on a healthy node
instead of routing back to the evicted one. The original error is returned
unchanged; only the registry is corrected.

Assisted-by: Claude:claude-opus-4-8 go vet
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-06-04 23:00:50 +00:00

..

feat(distributed): Add NATS JWT authentication and TLS/mTLS options (#10159 )

2026-06-03 19:43:56 +02:00

feat(tts): support per-request instructions and params (#10172 )

2026-06-04 11:45:02 +02:00

feat(tts): support per-request instructions and params (#10172 )

2026-06-04 11:45:02 +02:00

security(http): refuse redirects on outbound clients via hardened pkg/httpclient (#10087 )

2026-05-30 12:04:10 +02:00

fix(config): add face/speaker recognition constants and register insightface + speaker-recognition (#10110 )

2026-06-04 21:48:01 +02:00

dependencies_manager

feat(ui): move to React for frontend (#8772 )

2026-03-05 21:47:12 +01:00

feat(middleware): Model routing, PII filtering, Cloud model proxies (#9802 )

2026-05-25 09:28:27 +02:00

feat(parakeet-cpp): add NVIDIA NeMo Parakeet ASR backend (parakeet.cpp) (#10084 )

2026-05-30 14:46:10 +02:00

feat(tts): support per-request instructions and params (#10172 )

2026-06-04 11:45:02 +02:00

feat: add distributed mode (#9124 )

2026-03-30 00:47:27 +02:00

feat(tts): support per-request instructions and params (#10172 )

2026-06-04 11:45:02 +02:00

fix(distributed): self-heal stale 'model not loaded' routing

2026-06-04 23:00:50 +00:00

feat(gallery): verify backend OCI images with keyless cosign (#9823 )

2026-05-18 08:02:20 +02:00

fix(openresponses): populate Content and accept bare {role,content} items (#10039 ) (#10040 )

2026-05-28 07:21:48 +00:00

feat(middleware): Model routing, PII filtering, Cloud model proxies (#9802 )

2026-05-25 09:28:27 +02:00