LocalAI/.github at a5a5b2ad801763cee7303d90f6137f56130b70d4 - LocalAI - Gitea: Git with a cup of tea

mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-27 09:57:14 -04:00

Files

History

Ettore Di Giacinto a5a5b2ad80 feat(paged): bump llama.cpp pin 9d5d882d -> c299a92c (bit-exact verified)

Advance the paged-attention backend's owned llama.cpp pin by 23 upstream
commits. The shipped source-only patch series (0001-0030, 28 patches) applies
strict-clean (git apply, exit 0) on a fresh c299a92c checkout with no re-export
needed, and the bit-exact gate is GREEN on every path on GB10 (CUDA sm_121):

- md5 greedy decode (-ngl 99 -fa on -n 48 --temp 0 --seed 1): dense
  non-paged/paged 5951a5b4, MoE non-paged 07db32c2, MoE paged 8cb0ce23; all
  match the established baselines.
- test-backend-ops CUDA0: SSM_CONV 45/45, SSM_CONV_UPDATE 16/16,
  SSM_CONV_UPDATE_IDS 16/16, GATED_DELTA_NET 84/84, MUL_MAT 1146/1146,
  MUL_MAT_ID 806/806; all OK.

The 23-commit upstream jump did not change our decode output. The .patch files
are kept byte-identical (they already apply strict-clean at the new pin); only
the pin, the PIN_SYNC evidence doc, and the canary/gallery doc references change.

Assisted-by: Claude:opus-4.8 [Claude Code]
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-06-27 08:57:33 +00:00

..

ci: phase 1-3 of GHA free tier migration (path filter, multi-arch split prep, /mnt disk relief) (#9726 )

2026-05-08 23:43:41 +02:00

fix: roll out bluemonday Sanitize more widely (#3794 )

2024-10-12 09:45:47 +02:00

Harden gallery-agent Hugging Face fetches against transient rate limiting (#10187 )

2026-06-05 23:43:06 +02:00

docs/examples: enhancements (#1572 )

2024-01-18 19:41:08 +01:00

feat(paged): bump llama.cpp pin 9d5d882d -> c299a92c (bit-exact verified)

2026-06-27 08:57:33 +00:00

feat(paged): bump llama.cpp pin 9d5d882d -> c299a92c (bit-exact verified)

2026-06-27 08:57:33 +00:00

backend-matrix.yml

feat(paged): Metal/darwin build availability for llama-cpp-localai-paged

2026-06-27 07:42:08 +00:00

bump_deps.sh

feat: do not bundle llama-cpp anymore (#5790 )

2025-07-18 13:24:12 +02:00

bump_docs.sh

fix: github bump_docs.sh regex to drop emoji and other text (#2180 )

2024-04-29 03:55:29 +00:00

bump_vllm_metal.sh

feat(vllm): macOS/Metal support via vllm-metal (MLX) (#10489 )

2026-06-25 15:46:19 +02:00

bump_vllm_wheel.sh

feat(vllm): expose AsyncEngineArgs via generic engine_args YAML map (#9563 )

2026-04-29 00:49:28 +02:00

check_and_update.py

fix(ci): fixup checksum scanning pipeline (#3631 )

2024-09-23 10:56:10 +02:00

checksum_checker.sh

fix(ci): fixup correct path for check_and_update.py (#2777 )

2024-07-11 23:05:43 +02:00

dependabot.yml

feat: Add backend gallery (#5607 )

2025-06-15 14:56:52 +02:00

FUNDING.yml

Create FUNDING.yml (#725 )

2023-07-09 13:39:00 +02:00

labeler.yml

chore(ci): update labels

2025-02-13 09:58:19 +01:00

PULL_REQUEST_TEMPLATE.md

feat(vllm): Allow to set quantization (#1094 )

2023-09-22 15:52:38 +02:00

release.yml

feat(p2p): Federation and AI swarms (#2723 )

2024-07-08 22:04:06 +02:00

stale.yml

feat: add PR template and stale configuration (#316 )

2023-05-20 09:10:20 +02:00