LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-05-17 21:21:23 -04:00

Author	SHA1	Message	Date
Richard Palethorpe	73aacad2f9	fix(vllm): drop flash-attn wheel to avoid torch 2.10 ABI mismatch (#9557 ) The pinned flash-attn 2.8.3+cu12torch2.7 wheel breaks at import time once vllm 0.19.1 upgrades torch to its hard-pinned 2.10.0: ImportError: .../flash_attn_2_cuda...so: undefined symbol: _ZN3c104cuda29c10_cuda_check_implementationEiPKcS2_ib That C10 CUDA symbol is libtorch-version-specific. Dao-AILab has not yet published flash-attn wheels for torch 2.10 -- the latest release (2.8.3) tops out at torch 2.8 -- so any wheel pinned here is silently ABI-broken the moment vllm completes its install. vllm 0.19.1 lists flashinfer-python==0.6.6 as a hard dep, which already covers the attention path. The only other use of flash-attn in vllm is the rotary apply_rotary import in vllm/model_executor/layers/rotary_embedding/common.py, which is guarded by find_spec("flash_attn") and falls back cleanly when absent. Also unpin torch in requirements-cublas12.txt: the 2.7.0 pin only existed to give the flash-attn wheel a matching torch to link against. With flash-attn gone, vllm's own torch==2.10.0 dep is the binding constraint regardless of what we put here. Assisted-by: Claude:claude-opus-4-7 [Claude Code] Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-04-25 15:38:13 +00:00
Ettore Di Giacinto	d6409bd2eb	Revert "chore(deps): bump torch from 2.7.0 to 2.7.1+xpu in /backend/python/vllm in the pip group across 1 directory" (#8367 ) Revert "chore(deps): bump torch from 2.7.0 to 2.7.1+xpu in /backend/python/vl…" This reverts commit `4c0e70086d`.	2026-02-03 08:34:54 +01:00
dependabot[bot]	4c0e70086d	chore(deps): bump torch from 2.7.0 to 2.7.1+xpu in /backend/python/vllm in the pip group across 1 directory (#8360 ) chore(deps): bump torch Bumps the pip group with 1 update in the /backend/python/vllm directory: torch. Updates `torch` from 2.7.0 to 2.7.1+xpu --- updated-dependencies: - dependency-name: torch dependency-version: 2.7.1+xpu dependency-type: direct:production dependency-group: pip ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-02-03 03:07:02 +00:00
Ettore Di Giacinto	8b889955b4	chore(deps): bump pytorch to 2.7 in vllm (#5576 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-06-04 08:56:45 +02:00
Ettore Di Giacinto	5ffad3b004	chore(deps): remove pin on transformers (#5501 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-05-27 09:24:27 +02:00
Ettore Di Giacinto	6a382a1afe	fix(transformers): try to pin to working release (#5426 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-05-22 12:50:51 +02:00
Ettore Di Giacinto	3e77a17b26	fix(dependencies): pin pytorch version (#3872 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-10-18 09:11:59 +02:00
Ettore Di Giacinto	2553de0187	feat(vllm): add support for image-to-text and video-to-text (#3729 ) * feat(vllm): add support for image-to-text Related to https://github.com/mudler/LocalAI/issues/3670 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(vllm): add support for video-to-text Closes: https://github.com/mudler/LocalAI/issues/2318 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(vllm): support CPU installations Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(vllm): add bnb Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: add docs reference Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Apply suggestions from code review Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2024-10-04 23:42:05 +02:00
Ettore Di Giacinto	2c8623dbb4	fix(python): move vllm to after deps, drop diffusers main deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-07 23:34:37 +02:00
Ettore Di Giacinto	61b5602111	fix(python): move accelerate and GPU-specific libs to build-type (#3194 ) Some of the dependencies in `requirements.txt`, even if generic, pulls down the line CUDA libraries. This changes moves mostly all GPU-specific libs to the build-type, and tries a safer approach. In `requirements.txt` now are listed only "first-level" dependencies, for instance, grpc, but libs-dependencies are moved down to the respective build-type `requirements.txt` to avoid any mixin. This should fix #2737 and #1592. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-08-07 17:02:32 +02:00
cryptk	ed322bf59f	fix: ensure correct version of torch is always installed based on BUILD_TYPE(#2890 ) * fix: ensure correct version of torch is always installed based on BUILD_TYPE Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * Move causal-conv1d installation to build_types Signed-off-by: mudler <mudler@localai.io> * Move mamba-ssd install to build-type requirements.txt Signed-off-by: mudler <mudler@localai.io> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> Signed-off-by: mudler <mudler@localai.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: mudler <mudler@localai.io>	2024-08-05 16:38:33 +00:00

11 Commits