LocalAI/.github/workflows at feat/vllm-parity - LocalAI - Gitea: Git with a cup of tea

mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-04-17 05:18:53 -04:00

Files

History

Ettore Di Giacinto cd56a05c3e ci(vllm): disable tests-vllm-grpc job (heterogeneous runners)

Both ubuntu-latest and bigger-runner have inconsistent CPU baselines:
some instances support the AVX-512 VNNI/BF16 instructions the prebuilt
vllm 0.14.1+cpu wheel was compiled with, others SIGILL on import of
vllm.model_executor.models.registry. The libnuma packaging fix doesn't
help when the wheel itself can't be loaded.

FROM_SOURCE=true compiles vllm against the actual host CPU and works
everywhere, but takes 30-50 minutes per run — too slow for a smoke
test on every PR.

Comment out the job for now. The test itself is intact and passes
locally; run it via 'make test-extra-backend-vllm' on a host with the
required SIMD baseline. Re-enable when:
  - we have a self-hosted runner label with guaranteed AVX-512 VNNI/BF16, or
  - vllm publishes a CPU wheel with a wider baseline, or
  - we set up a docker layer cache that makes FROM_SOURCE acceptable

The detect-changes vllm output, the test harness changes (tests/
e2e-backends + tools cap), the make target (test-extra-backend-vllm),
the package.sh and the Dockerfile/install.sh plumbing all stay in
place.

2026-04-13 07:46:57 +00:00

..

chore(ci): disable CI actions

2026-03-02 14:48:00 +01:00

backend_build_darwin.yml

chore(deps): bump docker/metadata-action from 5 to 6 (#8917 )

2026-03-09 22:27:02 +01:00

backend_build.yml

chore(deps): bump docker/login-action from 3 to 4 (#8918 )

2026-03-09 22:30:11 +01:00

backend_pr.yml

Change runner from macOS-14 to macos-latest

2025-12-13 10:11:27 +01:00

backend.yml

ci(backend): build cpu-vllm container image

2026-04-12 14:48:28 +00:00

build-test.yaml

chore(deps): bump actions/upload-artifact from 6 to 7 (#8730 )

2026-03-02 21:43:39 +01:00

bump_deps.yaml

feat(backends): add ik-llama-cpp (#9326 )

2026-04-12 13:51:28 +02:00

bump_docs.yaml

fix(api)!: Stop model prior to deletion (#8422 )

2026-02-06 09:22:10 +01:00

bump-inference-defaults.yml

chore(deps): bump peter-evans/create-pull-request from 7 to 8 (#9114 )

2026-03-24 08:50:50 +01:00

checksum_checker.yaml

fix(api)!: Stop model prior to deletion (#8422 )

2026-02-06 09:22:10 +01:00

deploy-explorer.yaml

fix(api)!: Stop model prior to deletion (#8422 )

2026-02-06 09:22:10 +01:00

gallery-agent.yaml

chore(ci): fix gallery agent

2026-04-02 18:02:18 +00:00

generate_grpc_cache.yaml

chore(deps): bump docker/build-push-action from 6 to 7 (#8919 )

2026-03-09 22:29:51 +01:00

generate_intel_image.yaml

chore(deps): bump docker/login-action from 3 to 4 (#8918 )

2026-03-09 22:30:11 +01:00

gh-pages.yml

chore(deps): bump actions/upload-pages-artifact from 3 to 4 (#9179 )

2026-03-30 23:16:23 +02:00

image_build.yml

chore: drop AIO images (#9004 )

2026-03-14 17:49:36 +01:00

image-pr.yml

feat(rocm): bump to 7.x (#9323 )

2026-04-12 08:51:30 +02:00

image.yml

feat(rocm): bump to 7.x (#9323 )

2026-04-12 08:51:30 +02:00

notify-releases.yaml

fix(api)!: Stop model prior to deletion (#8422 )

2026-02-06 09:22:10 +01:00

release.yaml

chore(deps): bump goreleaser/goreleaser-action from 6 to 7 (#8634 )

2026-02-23 23:27:49 +01:00

secscan.yaml

Revert "chore(deps): bump securego/gosec from 2.22.9 to 2.22.11" (#7789 )

2025-12-30 09:58:13 +01:00

stalebot.yml

chore(deps): bump actions/stale from 10.1.1 to 10.2.0 (#8633 )

2026-02-23 23:27:20 +01:00

test-extra.yml

ci(vllm): disable tests-vllm-grpc job (heterogeneous runners)

2026-04-13 07:46:57 +00:00

test.yml

feat: add distributed mode (#9124 )

2026-03-30 00:47:27 +02:00

tests-e2e.yml

feat(realtime): WebRTC support (#8790 )

2026-03-13 21:37:15 +01:00

tests-ui-e2e.yml

chore(deps): bump actions/upload-artifact from 4 to 7 (#9030 )

2026-03-17 11:42:49 +01:00

update_swagger.yaml

fix(api)!: Stop model prior to deletion (#8422 )

2026-02-06 09:22:10 +01:00

yaml-check.yml

chore(backend gallery): add description for remaining backends (#5679 )

2025-06-17 22:21:44 +02:00