LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-28 10:27:30 -04:00

Files

LocalAI [bot] f3d829e2ef feat(distributed): add LOCALAI_DISTRIBUTED_SHARED_MODELS to skip staging on shared volumes (#10556 ) (#10566 )

In distributed mode, even when the frontend and workers share the same
models directory via a shared volume mount, starting a model on a worker
re-staged (re-downloaded) it: stageModelFiles always uploads model files
into a tracking-key-namespaced subdir on the worker, and the staging probe
only checks that staged location, so a file already present on the shared
volume at the canonical path was never reused.

Add a config switch LOCALAI_DISTRIBUTED_SHARED_MODELS (default false). When
enabled, the operator asserts that all nodes mount the SAME models directory
at the SAME path, so staging is unnecessary: the frontend's absolute model
paths are already valid on the worker. In that mode stageModelFiles returns
the cloned opts unchanged without uploading, leaving the path fields pointing
at their canonical absolute paths so the worker loads them directly from the
shared volume.

The value is plumbed from DistributedConfig through SmartRouterOptions into
the SmartRouter. Docs and docker-compose.distributed.yaml updated.


Assisted-by: Claude:claude-opus-4-8 [Claude Code]

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Co-authored-by: Ettore Di Giacinto <mudler@localai.io>

2026-06-28 01:23:07 +02:00

advisorylock

fix(auth): make advisory locks dialect-aware and harden SQLite DSN (#10509 )

2026-06-25 17:18:55 +02:00

agentpool

feat(distributed): SyncedMap component + migrate finetune/quant/agent-tasks to cross-replica state (#10542 )

2026-06-27 23:23:51 +02:00

agents

fix(agents): emit chat event timestamps in milliseconds (#9867 ) (#10243 )

2026-06-12 23:18:44 +02:00

cloudproxy

feat(pii): NER tier engine — privacy-filter.cpp backend + NER-centric PII filter (#10360 )