LocalAI/.agents at db14006fcd291403cfe4e69bd464bf3f3e361fc4 - LocalAI - Gitea: Git with a cup of tea

mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-27 18:06:58 -04:00

Files

History

Ettore Di Giacinto db14006fcd docs(agents): add paged-backend maintenance + vLLM-parity methodology skills

Two .agents guides (indexed in AGENTS.md):
- llama-cpp-localai-paged-backend.md: what the CUDA-only paged backend is, the
  patchset scope, the bit-exact gate, the manual pin-sync + weekly canary, the
  CUDA-only / stock-stays-pure invariants, and the Metal/SYCL/Vulkan follow-up scope.
- vllm-parity-methodology.md: the decode-parity playbook (bit-exact gating,
  profile-don't-assume, both-engine ground-truth, per-lever A/B, recording rejected
  levers, multi-agent GPU orchestration).

Assisted-by: Claude:opus-4.8 [Claude Code]
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-06-27 12:58:01 +00:00

..

adding-backends.md

docs(backends): make OS coverage explicit + require darwin support for new backends (#10516 )

2026-06-25 23:26:39 +02:00

adding-gallery-models.md

chore: add embeddingemma

2026-04-08 17:40:55 +00:00

ai-coding-assistants.md

docs(agents): adopt kernel's AI coding assistants policy

2026-04-19 22:50:54 +00:00

api-endpoints-and-auth.md

feat(realtime): Add Liquid Audio s2s model and assistant mode on talk page (#9801 )

2026-05-13 21:57:27 +02:00

backend-signing.md

ci(backend-signing): set COSIGN_EXPERIMENTAL=1 for oci-1-1 referrers mode

2026-05-24 08:21:05 +00:00

building-and-testing.md

test(react-ui): add page render-smoke specs, reset the coverage gate (#10122 )

2026-06-01 14:24:36 +02:00

ci-caching.md

docs(ci-caching): list all paths that retrigger base-images.yml

2026-05-09 22:31:37 +00:00

coding-style.md

security(http): refuse redirects on outbound clients via hardened pkg/httpclient (#10087 )

2026-05-30 12:04:10 +02:00

debugging-backends.md

feat: add (experimental) fine-tuning support with TRL (#9088 )

2026-03-21 02:08:02 +01:00

ds4-backend.md

feat(ds4): SSD streaming + quality engine options, 128GB DeepSeek gallery models (#10374 )

2026-06-17 10:30:06 +02:00

llama-cpp-backend.md

feat(llama-cpp): bump to MTP-merge SHA and automatically set MTP defaults (#9852 )

2026-05-16 22:42:48 +02:00

llama-cpp-localai-paged-backend.md

docs(agents): add paged-backend maintenance + vLLM-parity methodology skills

2026-06-27 12:58:01 +00:00

localai-assistant-mcp.md

feat: localai assistant chat modality (#9602 )

2026-04-28 19:29:27 +02:00

sglang-backend.md

feat(sglang): wire engine_args, add cuda13 build, ship MTP gallery demos (#9686 )

2026-05-07 17:27:29 +02:00

testing-mcp-apps.md

feat(ui): MCP Apps, mcp streaming and client-side support (#8947 )

2026-03-11 07:30:49 +01:00

vllm-backend.md

docs(agents): capture vllm backend lessons + runtime lib packaging (#9333 )

2026-04-13 11:09:57 +02:00

vllm-parity-methodology.md

docs(agents): add paged-backend maintenance + vLLM-parity methodology skills

2026-06-27 12:58:01 +00:00