mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-07 16:27:09 -04:00

Files

History

Ettore Di Giacinto d74cd56b14 feat(vllm): bundle libnuma/libgomp via package.sh

The vllm CPU wheel ships a _C extension that dlopens libnuma.so.1 at
import time; torch's CPU kernels in turn use libgomp.so.1 (OpenMP).
Without these on the host, vllm._C silently fails to register its
torch ops and EngineCore crashes with:

  AttributeError: '_OpNamespace' '_C_utils' object has no attribute
    'init_cpu_threads_env'

Rather than asking every user to install libnuma1/libgomp1 on their
host (or every LocalAI base image to ship them), bundle them into
the backend image itself — same pattern fish-speech and the GPU libs
already use. libbackend.sh adds ${EDIR}/lib to LD_LIBRARY_PATH at
run time so the bundled copies are picked up automatically.

- backend/python/vllm/package.sh (new): copies libnuma.so.1 and
  libgomp.so.1 from the builder's multilib paths into ${BACKEND}/lib,
  preserving soname symlinks. Runs during Dockerfile.python's
  'Run backend-specific packaging' step (which already invokes
  package.sh if present).
- backend/Dockerfile.python: install libnuma1 + libgomp1 in the
  builder stage so package.sh has something to copy (the Ubuntu
  base image otherwise only has libgomp in the gcc dep chain).
- test-extra.yml: drop the workaround that installed these libs on
  the runner host — with the backend image self-contained, the
  runner no longer needs them, and the test now exercises the
  packaging path end-to-end the way a production host would.

2026-04-12 20:20:21 +00:00

..

backend.py

fix(vllm): tool parser constructor compat + e2e tool calling test

2026-04-12 14:48:28 +00:00

install.sh

ci(vllm): use bigger-runner instead of source build

2026-04-12 16:02:49 +00:00

Makefile

feat(mlx): add mlx backend (#6049 )

2025-08-22 08:42:29 +02:00

package.sh

feat(vllm): bundle libnuma/libgomp via package.sh

2026-04-12 20:20:21 +00:00

README.md

refactor: move backends into the backends directory (#1279 )

2023-11-13 22:40:16 +01:00

requirements-after.txt

feat(vllm): CPU support + shared utils + vllm-omni feature parity

2026-04-12 14:48:28 +00:00

requirements-cpu-after.txt

fix(vllm): CPU build compatibility with vllm 0.14.1

2026-04-12 14:48:28 +00:00

requirements-cpu.txt

fix(vllm): CPU build compatibility with vllm 0.14.1

2026-04-12 14:48:28 +00:00

requirements-cublas12-after.txt

feat(vllm): CPU support + shared utils + vllm-omni feature parity

2026-04-12 14:48:28 +00:00

requirements-cublas12.txt

Revert "chore(deps): bump torch from 2.7.0 to 2.7.1+xpu in /backend/python/vllm in the pip group across 1 directory" (#8367 )

2026-02-03 08:34:54 +01:00

requirements-hipblas-after.txt

feat(vllm): CPU support + shared utils + vllm-omni feature parity

2026-04-12 14:48:28 +00:00

requirements-hipblas.txt

feat(rocm): bump to 7.x (#9323 )

2026-04-12 08:51:30 +02:00

requirements-install.txt

feat: migrate python backends from conda to uv (#2215 )

2024-05-10 15:08:08 +02:00

requirements-intel-after.txt

feat(vllm): CPU support + shared utils + vllm-omni feature parity

2026-04-12 14:48:28 +00:00

requirements-intel.txt

feat(qwen-tts): add Qwen-tts backend (#8163 )

2026-01-23 15:18:41 +01:00

requirements.txt

chore(deps): bump grpcio from 1.78.1 to 1.80.0 in /backend/python/vllm (#9177 )

2026-03-31 10:10:17 +02:00

run.sh

feat: Add backend gallery (#5607 )

2025-06-15 14:56:52 +02:00

test.py

feat(vllm): wire native tool/reasoning parsers + chat deltas + logprobs

2026-04-12 14:48:28 +00:00

test.sh

feat: Add backend gallery (#5607 )

2025-06-15 14:56:52 +02:00

README.md

Creating a separate environment for the vllm project

make vllm