LocalAI/core at 8cae99229c0672289edfb87ec731fffb6d815eff - LocalAI - Gitea: Git with a cup of tea

mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-24 08:38:51 -04:00

Files

History

Ettore Di Giacinto 800f749c7b fix: drop gguf VRAM estimation (now redundant) (#8325 )

fix: drop gguf VRAM estimation

Cleanup. This is now handled directly in llama.cpp, no need to estimate from Go.

VRAM estimation in general is tricky, but llama.cpp ( 41ea26144e/src/llama.cpp (L168) ) lately has added an automatic "fitting" of models to VRAM, so we can drop backend-specific GGUF VRAM estimation from our code instead of trying to guess as we already enable it

 397f7f0862/backend/cpp/llama-cpp/grpc-server.cpp (L393)

Fixes: https://github.com/mudler/LocalAI/issues/8302
See: https://github.com/mudler/LocalAI/issues/8302#issuecomment-3830773472

2026-02-01 17:33:28 +01:00

..

feat: disable force eviction (#7725 )

2025-12-25 14:26:18 +01:00

feat(api): Add transcribe response format request parameter & adjust STT backends (#8318 )

2026-02-01 17:33:17 +01:00

feat(api): Add transcribe response format request parameter & adjust STT backends (#8318 )

2026-02-01 17:33:17 +01:00

feat(store): add Golang client (#1977 )

2024-04-16 15:54:14 +02:00

fix: drop gguf VRAM estimation (now redundant) (#8325 )

2026-02-01 17:33:28 +01:00

dependencies_manager

fix: be consistent in downloading files, check for scanner errors (#3108 )

2024-08-02 20:06:25 +02:00

chore(refactor): move logging to common package based on slog (#7668 )

2025-12-21 19:33:13 +01:00

feat: Filter backend gallery by system capabilities (#7950 )

2026-01-10 23:34:01 +01:00

feat(api): Add transcribe response format request parameter & adjust STT backends (#8318 )

2026-02-01 17:33:17 +01:00

chore(refactor): move logging to common package based on slog (#7668 )

2025-12-21 19:33:13 +01:00

feat(api): Add transcribe response format request parameter & adjust STT backends (#8318 )

2026-02-01 17:33:17 +01:00

chore(refactor): move logging to common package based on slog (#7668 )

2025-12-21 19:33:13 +01:00

feat(api): Add transcribe response format request parameter & adjust STT backends (#8318 )

2026-02-01 17:33:17 +01:00

chore(refactor): move logging to common package based on slog (#7668 )

2025-12-21 19:33:13 +01:00