LocalAI/core/services at d2dbb81af47610d270bd943a441f5a5a2c76a53d - LocalAI - Gitea: Git with a cup of tea

mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-27 18:06:58 -04:00

Files

History

Ettore Di Giacinto d2dbb81af4 feat(distributed): add LOCALAI_DISTRIBUTED_SHARED_MODELS to skip staging on shared volumes (#10556 )

In distributed mode, even when the frontend and workers share the same
models directory via a shared volume mount, starting a model on a worker
re-staged (re-downloaded) it: stageModelFiles always uploads model files
into a tracking-key-namespaced subdir on the worker, and the staging probe
only checks that staged location, so a file already present on the shared
volume at the canonical path was never reused.

Add a config switch LOCALAI_DISTRIBUTED_SHARED_MODELS (default false). When
enabled, the operator asserts that all nodes mount the SAME models directory
at the SAME path, so staging is unnecessary: the frontend's absolute model
paths are already valid on the worker. In that mode stageModelFiles returns
the cloned opts unchanged without uploading, leaving the path fields pointing
at their canonical absolute paths so the worker loads them directly from the
shared volume.

The value is plumbed from DistributedConfig through SmartRouterOptions into
the SmartRouter. Docs and docker-compose.distributed.yaml updated.

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Assisted-by: Claude:claude-opus-4-8 [Claude Code]

2026-06-27 22:02:04 +00:00

..

fix(auth): make advisory locks dialect-aware and harden SQLite DSN (#10509 )

2026-06-25 17:18:55 +02:00

feat(distributed): SyncedMap component + migrate finetune/quant/agent-tasks to cross-replica state (#10542 )

2026-06-27 23:23:51 +02:00

fix(agents): emit chat event timestamps in milliseconds (#9867 ) (#10243 )

2026-06-12 23:18:44 +02:00

feat(pii): NER tier engine — privacy-filter.cpp backend + NER-centric PII filter (#10360 )

2026-06-18 11:45:22 +01:00

feat: add distributed mode (#9124 )

2026-03-30 00:47:27 +02:00

feat(distributed): SyncedMap component + migrate finetune/quant/agent-tasks to cross-replica state (#10542 )

2026-06-27 23:23:51 +02:00

facerecognition

feat(face-recognition): add insightface/onnx backend for 1:1 verify, 1:N identify, embedding, detection, analysis (#9480 )

2026-04-22 21:55:41 +02:00

feat(distributed): SyncedMap component + migrate finetune/quant/agent-tasks to cross-replica state (#10542 )

2026-06-27 23:23:51 +02:00

fix(distributed): broadcast admin model-config changes across replicas (#10540 )

2026-06-27 01:36:57 +02:00

fix(auth): make advisory locks dialect-aware and harden SQLite DSN (#10509 )

2026-06-25 17:18:55 +02:00

feat: add distributed mode (#9124 )

2026-03-30 00:47:27 +02:00

feat(distributed): SyncedMap component + migrate finetune/quant/agent-tasks to cross-replica state (#10542 )

2026-06-27 23:23:51 +02:00

fix(distributed): broadcast admin model-config changes across replicas (#10540 )

2026-06-27 01:36:57 +02:00

feat(middleware): Model routing, PII filtering, Cloud model proxies (#9802 )

2026-05-25 09:28:27 +02:00

feat(distributed): add LOCALAI_DISTRIBUTED_SHARED_MODELS to skip staging on shared volumes (#10556 )

2026-06-27 22:02:04 +00:00

feat(distributed): SyncedMap component + migrate finetune/quant/agent-tasks to cross-replica state (#10542 )

2026-06-27 23:23:51 +02:00

fix(pii): post-merge review fixes + live NER e2e for the privacy-filter tier (#10401 )

2026-06-22 18:26:19 +02:00

refactor(agents): bump skillserver, drop redundant Name from list_skills output (#9916 )

2026-05-21 14:45:53 +02:00

feat: track files being staged (#9275 )

2026-04-08 14:33:58 +02:00

feat(distributed): SyncedMap component + migrate finetune/quant/agent-tasks to cross-replica state (#10542 )

2026-06-27 23:23:51 +02:00

feat(distributed): SyncedMap component + migrate finetune/quant/agent-tasks to cross-replica state (#10542 )

2026-06-27 23:23:51 +02:00

voicerecognition

feat: voice recognition (#9500 )

2026-04-23 12:07:14 +02:00

feat(config): hardware-tuned defaults — Blackwell batch + VRAM-scaled concurrency (#10411 )

2026-06-20 14:45:59 +02:00