mirror of
https://github.com/mudler/LocalAI.git
synced 2026-05-24 16:51:44 -04:00
feat(flash_attention): set auto for flash_attention in llama.cpp (#6168)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
This commit is contained in:
committed by
GitHub
parent
dbdf2908ad
commit
739573e41b
@@ -1,5 +1,5 @@
|
||||
|
||||
LLAMA_VERSION?=3d16b29c3bb1ec816ac0e782f20d169097063919
|
||||
LLAMA_VERSION?=4d74393bcc956ccd7df68a6a06d1a0575cfa712c
|
||||
LLAMA_REPO?=https://github.com/ggerganov/llama.cpp
|
||||
|
||||
CMAKE_ARGS?=
|
||||
|
||||
Reference in New Issue
Block a user