LocalAI/.agents at ea72a56e2c8f423e40c839d18833eed669c18e03 - LocalAI - Gitea: Git with a cup of tea

mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-28 18:37:43 -04:00

Files

History

Ettore Di Giacinto 266fcc79ad docs(agents): fix A/B-bench gotcha - env-toggle != stock for compiled-in wins

The DGX re-run showed toggling LLAMA_KV_PAGED on/off on the patched binary does
NOT reproduce stock: the dominant SSM decode fusions are compiled in, not
runtime-gated, so the toggle measures only the (here ~neutral) paged-KV part.
True stock needs a separately-built unpatched binary at the same pin. Correct the
methodology skill's per-lever discipline + apples-to-apples rule accordingly.

Assisted-by: Claude:opus-4.8 [Claude Code]
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-06-27 22:09:05 +00:00

..

adding-backends.md

docs(backends): make OS coverage explicit + require darwin support for new backends (#10516 )

2026-06-25 23:26:39 +02:00

adding-gallery-models.md

chore: add embeddingemma

2026-04-08 17:40:55 +00:00

ai-coding-assistants.md

docs(agents): adopt kernel's AI coding assistants policy

2026-04-19 22:50:54 +00:00

api-endpoints-and-auth.md

feat(realtime): Add Liquid Audio s2s model and assistant mode on talk page (#9801 )

2026-05-13 21:57:27 +02:00

backend-signing.md

ci(backend-signing): set COSIGN_EXPERIMENTAL=1 for oci-1-1 referrers mode

2026-05-24 08:21:05 +00:00

building-and-testing.md

test(react-ui): add page render-smoke specs, reset the coverage gate (#10122 )

2026-06-01 14:24:36 +02:00

ci-caching.md

docs(ci-caching): list all paths that retrigger base-images.yml

2026-05-09 22:31:37 +00:00

coding-style.md

security(http): refuse redirects on outbound clients via hardened pkg/httpclient (#10087 )

2026-05-30 12:04:10 +02:00

debugging-backends.md

feat: add (experimental) fine-tuning support with TRL (#9088 )

2026-03-21 02:08:02 +01:00

ds4-backend.md

feat(ds4): SSD streaming + quality engine options, 128GB DeepSeek gallery models (#10374 )

2026-06-17 10:30:06 +02:00

llama-cpp-backend.md

feat(llama-cpp): bump to MTP-merge SHA and automatically set MTP defaults (#9852 )

2026-05-16 22:42:48 +02:00

llama-cpp-localai-paged-backend.md

docs(paged): drop moot PIN_SYNC_c299a92c record, repoint to README sec 7

2026-06-27 21:34:10 +00:00

localai-assistant-mcp.md

feat: localai assistant chat modality (#9602 )

2026-04-28 19:29:27 +02:00

sglang-backend.md

feat(sglang): wire engine_args, add cuda13 build, ship MTP gallery demos (#9686 )

2026-05-07 17:27:29 +02:00

testing-mcp-apps.md

feat(ui): MCP Apps, mcp streaming and client-side support (#8947 )

2026-03-11 07:30:49 +01:00

vllm-backend.md

docs(agents): capture vllm backend lessons + runtime lib packaging (#9333 )

2026-04-13 11:09:57 +02:00

vllm-parity-methodology.md

docs(agents): fix A/B-bench gotcha - env-toggle != stock for compiled-in wins

2026-06-27 22:09:05 +00:00