mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-08-02 03:20:12 -04:00

Files

History

Ettore Di Giacinto c2f73a987e fix(vllm): CPU build compatibility with vllm 0.14.1

Validated end-to-end on CPU with Qwen2.5-0.5B-Instruct (LoadModel, Predict,
TokenizeString, Free all working).

- requirements-cpu-after.txt: pin vllm to 0.14.1+cpu (pre-built wheel from
  GitHub releases) for x86_64 and aarch64. vllm 0.14.1 is the newest CPU
  wheel whose torch dependency resolves against published PyTorch builds
  (torch==2.9.1+cpu). Later vllm CPU wheels currently require
  torch==2.10.0+cpu which is only available on the PyTorch test channel
  with incompatible torchvision.
- requirements-cpu.txt: bump torch to 2.9.1+cpu, add torchvision/torchaudio
  so uv resolves them consistently from the PyTorch CPU index.
- install.sh: add --index-strategy=unsafe-best-match for CPU builds so uv
  can mix the PyTorch index and PyPI for transitive deps (matches the
  existing intel profile behaviour).
- backend.py LoadModel: vllm >= 0.14 removed AsyncLLMEngine.get_model_config
  so the old code path errored out with AttributeError on model load.
  Switch to the new get_tokenizer()/tokenizer accessor with a fallback
  to building the tokenizer directly from request.Model.

2026-04-12 14:48:28 +00:00

..

backend.py

fix(vllm): CPU build compatibility with vllm 0.14.1

2026-04-12 14:48:28 +00:00

install.sh

fix(vllm): CPU build compatibility with vllm 0.14.1

2026-04-12 14:48:28 +00:00

Makefile

feat(mlx): add mlx backend (#6049 )

2025-08-22 08:42:29 +02:00

README.md

refactor: move backends into the backends directory (#1279 )

2023-11-13 22:40:16 +01:00

requirements-after.txt

feat(vllm): CPU support + shared utils + vllm-omni feature parity

2026-04-12 14:48:28 +00:00

requirements-cpu-after.txt

fix(vllm): CPU build compatibility with vllm 0.14.1

2026-04-12 14:48:28 +00:00

requirements-cpu.txt

fix(vllm): CPU build compatibility with vllm 0.14.1

2026-04-12 14:48:28 +00:00

requirements-cublas12-after.txt

feat(vllm): CPU support + shared utils + vllm-omni feature parity

2026-04-12 14:48:28 +00:00

requirements-cublas12.txt

Revert "chore(deps): bump torch from 2.7.0 to 2.7.1+xpu in /backend/python/vllm in the pip group across 1 directory" (#8367 )

2026-02-03 08:34:54 +01:00

requirements-hipblas-after.txt

feat(vllm): CPU support + shared utils + vllm-omni feature parity

2026-04-12 14:48:28 +00:00

requirements-hipblas.txt

feat(rocm): bump to 7.x (#9323 )

2026-04-12 08:51:30 +02:00

requirements-install.txt

feat: migrate python backends from conda to uv (#2215 )

2024-05-10 15:08:08 +02:00

requirements-intel-after.txt

feat(vllm): CPU support + shared utils + vllm-omni feature parity

2026-04-12 14:48:28 +00:00

requirements-intel.txt

feat(qwen-tts): add Qwen-tts backend (#8163 )

2026-01-23 15:18:41 +01:00

requirements.txt

chore(deps): bump grpcio from 1.78.1 to 1.80.0 in /backend/python/vllm (#9177 )

2026-03-31 10:10:17 +02:00

run.sh

feat: Add backend gallery (#5607 )

2025-06-15 14:56:52 +02:00

test_cpu_inference.py

feat(vllm): CPU support + shared utils + vllm-omni feature parity

2026-04-12 14:48:28 +00:00

test.py

feat(vllm): wire native tool/reasoning parsers + chat deltas + logprobs

2026-04-12 14:48:28 +00:00

test.sh

feat: Add backend gallery (#5607 )

2025-06-15 14:56:52 +02:00

README.md

Creating a separate environment for the vllm project

make vllm