feat(flash_attention): set auto for flash_attention in llama.cpp (#6168)

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2026-07-12 09:20:23 -04:00 · 2025-08-31 17:59:09 +02:00
parent dbdf2908ad
commit 739573e41b
5 changed files with 22 additions and 8 deletions
--- a/backend/cpp/llama-cpp/Makefile
+++ b/backend/cpp/llama-cpp/Makefile
@@ -1,5 +1,5 @@

-LLAMA_VERSION?=3d16b29c3bb1ec816ac0e782f20d169097063919
+LLAMA_VERSION?=4d74393bcc956ccd7df68a6a06d1a0575cfa712c
 LLAMA_REPO?=https://github.com/ggerganov/llama.cpp

 CMAKE_ARGS?=