LocalAI/backend/cpp/llama-cpp/Makefile at 2f648dc6a06b3bc7d157bdfd6c6f6da745afaa80

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-23 16:19:07 -04:00

Files

Ettore Di Giacinto ba3fa5a633 build(paged): stacking patch-series scaffolding for llama.cpp paged attention

Numbered patches under backend/cpp/llama-cpp/patches/ applied in order against
the pinned LLAMA_VERSION (build hook in the llama.cpp: target). Each phase is one
small, independently-buildable patch so the work rebases cleanly across llama.cpp
bumps (anti-drift). README defines the series (0001 vendor manager -> 0006 prefix
caching) + the regen workflow.

Assisted-by: Claude:opus-4.8 [Claude Code]
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-06-19 22:53:20 +00:00

7.3 KiB

Raw Blame History

View Raw

7.3 KiB Raw Blame History

7.3 KiB

Raw Blame History