LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-07-01 11:56:57 -04:00

Files

Adira 4ec39bb776 fix(watchdog): don't log optional Free() as an error when backend returns Unimplemented (#10602 ) (#10607 )

* fix(watchdog): don't log optional Free() as an error when backend returns Unimplemented (#10602)

When the watchdog evicts a model, deleteProcess calls the backend's gRPC
Free() to release VRAM before stopping the process. Free is optional:
backends that don't override it -- the generated UnimplementedBackendServer
stub, many Python/external backends, or a federation proxy in distributed
mode -- return gRPC Unimplemented. That is expected, not a failure: VRAM is
reclaimed when the local process is stopped, or by the remote unloader for
remote backends. Logging it as "WARN Error freeing GPU resources" made a
benign, optional RPC look like a fault (the alarming line in #10602, seen
in distributed mode where the model is remote and Free hits a stub).

Treat gRPC Unimplemented from Free() as a no-op logged at Debug; genuine
failures still Warn. Free() is still attempted for every backend, so any
backend that does implement it is unaffected.

Add a reusable grpcerrors.IsUnimplemented helper following the package's
existing code-based detection idiom (prefer the typed status code, fall
back to the message across non-gRPC boundaries), with table tests.

Assisted-by: Claude:claude-opus-4-8 [Claude Code]

Signed-off-by: Adira Denis Muhando <dennisadira@gmail.com>

* fix(watchdog): log a non-Unimplemented Free() failure at error level

Per review: now that the expected gRPC Unimplemented case is split out and
logged at Debug, any remaining Free() error is a genuine failure to release
VRAM, so surface it at error level instead of warn.

Assisted-by: Claude:claude-opus-4-8 [Claude Code]

Signed-off-by: Adira Denis Muhando <dennisadira@gmail.com>

---------

Signed-off-by: Adira Denis Muhando <dennisadira@gmail.com>

2026-06-30 22:14:01 +02:00

backend_log_store_test.go

feat: react chat redesign (#9616 )

2026-04-29 22:33:26 +02:00

backend_log_store.go

feat: react chat redesign (#9616 )

2026-04-29 22:33:26 +02:00

connection_errors.go

fix(nodes): better detection if nodes goes down or model is not available (#9274 )

2026-04-08 12:11:02 +02:00

connection_evicting_client.go

feat(backend): add depth-anything (Depth Anything 3) C++/ggml backend + gallery (#10352 )