Ettore Di Giacinto
c27da0a0f6
fix(diffusers): fix float detection ( #6313 )
...
There was apparently an oversight, this fixes the float/int detection
Fixes: https://github.com/mudler/LocalAI/issues/6312
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-09-19 19:09:04 +02:00
Ettore Di Giacinto
ac043ed9ba
chore(model gallery): add aquif-3.5-a4b-think ( #6311 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-19 11:16:50 +02:00
Ettore Di Giacinto
2e0d66a1c8
chore(model gallery): add impish_qwen_14b-1m ( #6310 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-19 10:57:33 +02:00
Ettore Di Giacinto
41a0f361eb
chore(model gallery): add mistralai_magistral-small-2509 ( #6309 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-19 10:48:13 +02:00
LocalAI [bot]
d3c5c02837
docs: ⬆️ update docs version mudler/LocalAI ( #6307 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-18 23:48:02 +02:00
LocalAI [bot]
ae3d8fb0c4
chore: ⬆️ Update ggml-org/llama.cpp to 3edd87cd055a45d885fa914d879d36d33ecfc3e1 ( #6308 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-18 21:09:14 +00:00
LocalAI [bot]
902e47f0b0
chore: ⬆️ Update ggml-org/llama.cpp to 0320ac5264279d74f8ee91bafa6c90e9ab9bbb91 ( #6306 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-18 09:27:18 +02:00
Ettore Di Giacinto
50bb78fd24
Add permissions for issues and actions
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-09-18 09:26:10 +02:00
LocalAI [bot]
542f07ab2d
docs: ⬆️ update docs version mudler/LocalAI ( #6305 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-17 21:06:50 +00:00
Ettore Di Giacinto
77c5acb9db
Revert "feat(nvidia-gpu): bump images to cuda 12.8" ( #6303 )
...
Revert "feat(nvidia-gpu): bump images to cuda 12.8 (#6239 )"
This reverts commit d9e25af7b5 .
2025-09-17 19:31:43 +02:00
Ettore Di Giacinto
44bbf4d778
chore(model gallery): add websailor-7b ( #6300 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-17 09:49:58 +02:00
Ettore Di Giacinto
633c12f93d
chore(model gallery): add websailor-32b ( #6299 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-17 09:48:16 +02:00
Ettore Di Giacinto
6f24135f1d
chore(model gallery): add webwatcher-32b ( #6298 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-17 09:42:54 +02:00
Ettore Di Giacinto
b72aa7b4fa
chore(model gallery): add webwatcher-7b ( #6297 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-17 09:36:25 +02:00
Ettore Di Giacinto
e94e725479
chore(model gallery): add alibaba-nlp_tongyi-deepresearch-30b-a3b ( #6295 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-17 09:22:19 +02:00
LocalAI [bot]
e4ac7b14a3
chore: ⬆️ Update ggml-org/llama.cpp to 8ff206097c2bf3ca1c7aa95f9d6db779fc7bdd68 ( #6292 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-16 21:09:47 +00:00
Ettore Di Giacinto
ddb39c73f2
chore(model gallery): add holo1.5-3b ( #6291 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-16 18:13:11 +02:00
Ettore Di Giacinto
264b09fb1e
chore(model gallery): add holo1.5-7b ( #6290 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-16 18:10:27 +02:00
Ettore Di Giacinto
36dd45df51
chore(model gallery): add holo1.5-72b ( #6289 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-16 18:07:50 +02:00
Ettore Di Giacinto
e5599f87b8
chore(model gallery): add k2-think-i1 ( #6288 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-16 18:05:01 +02:00
LocalAI [bot]
e89b5cc0e3
chore: ⬆️ Update ggml-org/llama.cpp to b907255f4bd169b0dc7dca9553b4c54af5170865 ( #6287 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-16 08:10:37 +02:00
Richard Palethorpe
10bf1084cc
chore: ⬆️ Update leejet/stable-diffusion.cpp to 0ebe6fe118f125665939b27c89f34ed38716bff8 ( #6271 )
...
* ⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* fix(stablediffusion-ggml): Move parameters and start refactor of passing params
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(stablediffusion-ggml): Add default sampler option
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-15 21:42:46 +02:00
Ettore Di Giacinto
b08ae559b3
chore(model gallery): add qwen3-stargate-sg1-uncensored-abliterated-8b-i1 ( #6270 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-15 11:03:26 +02:00
Ettore Di Giacinto
aa7cb7e18c
chore(model gallery): add aquif-ai_aquif-3.5-8b-think ( #6269 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-15 10:42:42 +02:00
Ettore Di Giacinto
eadd3d4e46
chore(model gallery): add baidu_ernie-4.5-21b-a3b-thinking ( #6267 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-15 10:27:02 +02:00
LocalAI [bot]
2a18206033
chore: ⬆️ Update ggml-org/llama.cpp to 6c019cb04e86e2dacfe62ce7666c64e9717dde1f ( #6265 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-14 21:19:41 +00:00
LocalAI [bot]
39798d734e
chore: ⬆️ Update ggml-org/llama.cpp to 0fa154e3502e940df914f03b41475a2b80b985b0 ( #6263 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-14 19:59:58 +00:00
Gianluca Boiano
d0e99562af
chore(aio): upgrade minicpm-v model to latest 4.5 ( #6262 )
...
chore(aio): upgrade vision model to MiniCPM-V 4.5
Signed-off-by: Gianluca Boiano <morf3089@gmail.com >
2025-09-14 15:04:58 +02:00
Ettore Di Giacinto
6410c99bf2
fix(llama-cpp): correctly calculate embeddings ( #6259 )
...
* chore(tests): check embeddings differs in llama.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(llama.cpp): use the correct field for embedding
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(llama.cpp): use embedding type none
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(tests): add test-cases in aio-e2e suite
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-13 23:11:54 +02:00
LocalAI [bot]
55766d269b
chore: ⬆️ Update ggml-org/llama.cpp to aa0c461efe3603639af1a1defed2438d9c16ca0f ( #6261 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-13 21:11:18 +00:00
Ettore Di Giacinto
ffa0ad1eac
Fix formatting issues in README.md links
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-09-13 09:16:17 +02:00
LocalAI [bot]
623789a29e
chore: ⬆️ Update ggml-org/llama.cpp to 40be51152d4dc2d47444a4ed378285139859895b ( #6260 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-12 21:10:39 +00:00
Richard Palethorpe
2b9a3d32c9
chore: ⬆️ Update leejet/stable-diffusion.cpp to fce6afcc6a3250a8e17923608922d2a99b339b47 ( #6256 )
...
* ⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* fix(stablediffusion-ggml): Add SMOOTHSTEP scheduler and assert sampler and scheduler counts
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-12 12:28:20 +02:00
LocalAI [bot]
f8b71dc5d0
chore: ⬆️ Update ggml-org/llama.cpp to 0e6ff0046f4a2983b2c77950aa75960fe4b4f0e2 ( #6235 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-11 21:21:49 +00:00
KingJ
1d3331b5cb
fix(rocm): Rename tag suffix for hipblas whisper build to match backend config ( #6247 )
...
Rename tag suffix for hipblas whisper to match backend config
hipblas images generally have the suffix `-gpu-rocm-hipblas-X`. One exception to this currently is the hipblas build of Whisper which has the suffix `gpu-hipblas-whisper.
However, as `backend/index.yaml` references the image tag for Whisper using the more consistent form (i.e. `latest-gpu-rocm-hipblas-whisper`), it is not possible to add the backend as raised in #6114 .
Therefore, rename the suffix for hipblas whisper images to use the more consistent form, aligning with other hipblas builds as well as the expected image name in `backend/index.yaml`.
Signed-off-by: Kingsley Jarrett <kj@kingj.net >
2025-09-11 21:19:09 +02:00
Mário Freitas
2c0b9c6349
fix(chat): use proper finish_reason for tool/function calling ( #6243 )
...
Signed-off-by: Mário Freitas <imkira@gmail.com >
2025-09-11 21:13:23 +02:00
qxo
3c6c976755
feat: support HF_ENDPOINT env for the HuggingFace endpoint ( #6220 )
...
ie: `HF_ENDPOINT=https://hf-mirror.com `
2025-09-11 21:04:57 +02:00
Sertaç Özercan
ebbcba342a
fix: runtime capability detection for backends ( #6149 )
...
* runtime capability detection for backends
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
* test
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
* skip nvidia on darwin
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
* address review comments
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
* fix apple test
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
* remove unused func
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
---------
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
2025-09-11 10:46:19 +02:00
LocalAI [bot]
0de75519dc
chore: ⬆️ Update leejet/stable-diffusion.cpp to b0179181069254389ccad604e44f17a2c25b4094 ( #6246 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-10 23:43:12 +02:00
Richard Palethorpe
37f5e4f5c1
feat(whisper): Add diarization (tinydiarize) ( #6184 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-09-10 19:09:28 +02:00
Ettore Di Giacinto
ffa934b959
feat(chatterbox): add MPS, and CPU, pin version ( #6242 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-09 17:58:07 +02:00
Mauro Morales
59311d8b1e
Point to LocalAI-examples repo for llava ( #6241 )
...
Signed-off-by: Mauro Morales <contact@mauromorales.com >
2025-09-09 16:40:55 +02:00
Ettore Di Giacinto
d9e25af7b5
feat(nvidia-gpu): bump images to cuda 12.8 ( #6239 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-09 13:02:17 +02:00
dependabot[bot]
e4f8b63b40
chore(deps): bump actions/labeler from 5 to 6 ( #6229 )
...
Bumps [actions/labeler](https://github.com/actions/labeler ) from 5 to 6.
- [Release notes](https://github.com/actions/labeler/releases )
- [Commits](https://github.com/actions/labeler/compare/v5...v6 )
---
updated-dependencies:
- dependency-name: actions/labeler
dependency-version: '6'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-09 08:57:13 +02:00
dependabot[bot]
1364ae9be6
chore(deps): bump github.com/swaggo/swag from 1.16.3 to 1.16.6 ( #6222 )
...
Bumps [github.com/swaggo/swag](https://github.com/swaggo/swag ) from 1.16.3 to 1.16.6.
- [Release notes](https://github.com/swaggo/swag/releases )
- [Changelog](https://github.com/swaggo/swag/blob/master/.goreleaser.yml )
- [Commits](https://github.com/swaggo/swag/compare/v1.16.3...v1.16.6 )
---
updated-dependencies:
- dependency-name: github.com/swaggo/swag
dependency-version: 1.16.6
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-09 08:56:59 +02:00
dependabot[bot]
cfd6a9150d
chore(deps): bump oras.land/oras-go/v2 from 2.5.0 to 2.6.0 ( #6225 )
...
Bumps [oras.land/oras-go/v2](https://github.com/oras-project/oras-go ) from 2.5.0 to 2.6.0.
- [Release notes](https://github.com/oras-project/oras-go/releases )
- [Commits](https://github.com/oras-project/oras-go/compare/v2.5.0...v2.6.0 )
---
updated-dependencies:
- dependency-name: oras.land/oras-go/v2
dependency-version: 2.6.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-08 23:43:28 +00:00
dependabot[bot]
cd352d0c5f
chore(deps): bump go.opentelemetry.io/otel/exporters/prometheus from 0.50.0 to 0.60.0 ( #6226 )
...
chore(deps): bump go.opentelemetry.io/otel/exporters/prometheus
Bumps [go.opentelemetry.io/otel/exporters/prometheus](https://github.com/open-telemetry/opentelemetry-go ) from 0.50.0 to 0.60.0.
- [Release notes](https://github.com/open-telemetry/opentelemetry-go/releases )
- [Changelog](https://github.com/open-telemetry/opentelemetry-go/blob/main/CHANGELOG.md )
- [Commits](https://github.com/open-telemetry/opentelemetry-go/compare/example/prometheus/v0.50.0...exporters/prometheus/v0.60.0 )
---
updated-dependencies:
- dependency-name: go.opentelemetry.io/otel/exporters/prometheus
dependency-version: 0.60.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-09 00:21:03 +02:00
LocalAI [bot]
8d47309695
chore: ⬆️ Update ggml-org/whisper.cpp to edea8a9c3cf0eb7676dcdb604991eb2f95c3d984 ( #6237 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-09 00:20:31 +02:00
LocalAI [bot]
5f6fc02a55
chore: ⬆️ Update leejet/stable-diffusion.cpp to abb115cd021fc2beed826604ed1a479b6a77671c ( #6236 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-09 00:20:03 +02:00
Ettore Di Giacinto
0b528458d8
chore(docs): add MacOS dmg download button ( #6233 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-09 00:19:37 +02:00
Ettore Di Giacinto
caab380c5d
feat(launcher): show welcome page ( #6234 )
...
feat(launcher): add welcome window
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-09 00:14:58 +02:00
dependabot[bot]
8a3a362504
chore(deps): bump actions/stale from 9.1.0 to 10.0.0 ( #6227 )
...
Bumps [actions/stale](https://github.com/actions/stale ) from 9.1.0 to 10.0.0.
- [Release notes](https://github.com/actions/stale/releases )
- [Changelog](https://github.com/actions/stale/blob/main/CHANGELOG.md )
- [Commits](5bef64f19d...3a9db7e6a4 )
---
updated-dependencies:
- dependency-name: actions/stale
dependency-version: 10.0.0
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-08 22:50:59 +02:00
dependabot[bot]
07238eb743
chore(deps): bump github.com/opencontainers/image-spec from 1.1.0 to 1.1.1 ( #6223 )
...
chore(deps): bump github.com/opencontainers/image-spec
Bumps [github.com/opencontainers/image-spec](https://github.com/opencontainers/image-spec ) from 1.1.0 to 1.1.1.
- [Release notes](https://github.com/opencontainers/image-spec/releases )
- [Changelog](https://github.com/opencontainers/image-spec/blob/main/RELEASES.md )
- [Commits](https://github.com/opencontainers/image-spec/compare/v1.1.0...v1.1.1 )
---
updated-dependencies:
- dependency-name: github.com/opencontainers/image-spec
dependency-version: 1.1.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-08 20:06:46 +00:00
Ettore Di Giacinto
e905e90dd7
Add MLX-audio entry to compatibility table
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-09-08 09:54:01 +02:00
LocalAI [bot]
08432d49e5
chore: ⬆️ Update ggml-org/llama.cpp to 3976dfbe00f02a62c0deca32c46138e4f0ca81d8 ( #6214 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-08 08:33:33 +02:00
LocalAI [bot]
e51e2aacb9
chore: ⬆️ Update leejet/stable-diffusion.cpp to c648001030d4c2cc7c851fdaf509ee36d642dc99 ( #6215 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-07 21:04:02 +00:00
Richard Palethorpe
9c3d85fc28
chore: ⬆️ Update leejet/stable-diffusion.cpp to d7f430cd693f2e12ecbaa0ce881746cf305c3b1f ( #6213 )
...
* ⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* fix(stablediffusion-ggml): Use new sample_params_t
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-07 16:34:45 +02:00
LocalAI [bot]
007ca647a7
chore(model-gallery): ⬆️ update checksum ( #6211 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-07 00:41:40 +02:00
LocalAI [bot]
59af928379
chore: ⬆️ Update ggml-org/llama.cpp to c4df49a42d396bdf7344501813e7de53bc9e7bb3 ( #6209 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-06 21:05:07 +00:00
LocalAI [bot]
dbc2bb561b
chore: ⬆️ Update ggml-org/llama.cpp to 408ff524b40baf4f51a81d42a9828200dd4fcb6b ( #6207 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-06 09:09:57 +02:00
LocalAI [bot]
c72c85dcac
chore: ⬆️ Update ggml-org/whisper.cpp to bb0e1fc60f26a707cabf724edcf7cfcab2a269b6 ( #6203 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-06 09:08:34 +02:00
Gianluca Boiano
ef984901e6
chore(model gallery): add MiniCPM-V-4.5-8b-q4_K_M ( #6205 )
...
Signed-off-by: Gianluca Boiano <morf3089@gmail.com >
2025-09-05 22:12:31 +02:00
Aliz Fara
9911ec84a3
Fix Typos in Docs ( #6204 )
...
Signed-off-by: alizfara112 <alizfaraafa@gmail.com >
2025-09-05 22:11:21 +02:00
LocalAI [bot]
1956681d4c
chore: ⬆️ Update ggml-org/llama.cpp to fb15d649ed14ab447eeab911e0c9d21e35fb243e ( #6202 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-05 08:42:50 +02:00
LocalAI [bot]
326f6e5ccb
docs: ⬆️ update docs version mudler/LocalAI ( #6201 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-04 21:03:02 +00:00
Ettore Di Giacinto
302958efd6
fix(p2p): automatically install llama-cpp for p2p workers ( #6199 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-09-04 21:57:39 +02:00
Ettore Di Giacinto
3dc86b247d
fix: make sure to turn down all processes on exit ( #6200 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-04 21:37:28 +02:00
Ettore Di Giacinto
5ec724af06
chore(model gallery): fix whisper model gallery links
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-04 13:32:09 +02:00
Ettore Di Giacinto
1f1e156bf0
chore(model gallery): add nousresearch_hermes-4-14b ( #6197 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-04 09:48:53 +02:00
LocalAI [bot]
df625e366a
chore: ⬆️ Update leejet/stable-diffusion.cpp to 2eb3845df5675a71565d5a9e13b7bad0881fafcd ( #6192 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-04 07:35:58 +02:00
LocalAI [bot]
9e6685ac9c
chore: ⬆️ Update ggml-org/llama.cpp to 0fce7a1248b74148c1eb0d368b7e18e8bcb96809 ( #6193 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-04 07:35:28 +02:00
Ettore Di Giacinto
90c818aa71
Update DMG file path in release workflow
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-09-04 07:34:27 +02:00
Ettore Di Giacinto
034b9b691b
chore(ci): fixup release pipeline
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-03 22:22:47 +02:00
dependabot[bot]
ba52822e5c
chore(deps): bump github.com/docker/docker from 28.0.0+incompatible to 28.3.3+incompatible ( #6181 )
...
chore(deps): bump github.com/docker/docker
Bumps [github.com/docker/docker](https://github.com/docker/docker ) from 28.0.0+incompatible to 28.3.3+incompatible.
- [Release notes](https://github.com/docker/docker/releases )
- [Commits](https://github.com/docker/docker/compare/v28.0.0...v28.3.3 )
---
updated-dependencies:
- dependency-name: github.com/docker/docker
dependency-version: 28.3.3+incompatible
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-03 13:42:54 +00:00
dependabot[bot]
eb30f6c090
chore(deps): bump github.com/tmc/langchaingo from 0.1.12 to 0.1.13 ( #6190 )
...
Bumps [github.com/tmc/langchaingo](https://github.com/tmc/langchaingo ) from 0.1.12 to 0.1.13.
- [Release notes](https://github.com/tmc/langchaingo/releases )
- [Commits](https://github.com/tmc/langchaingo/compare/v0.1.12...v0.1.13 )
---
updated-dependencies:
- dependency-name: github.com/tmc/langchaingo
dependency-version: 0.1.13
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-03 14:22:51 +02:00
Ettore Di Giacinto
caba098959
chore(model gallery): add invisietch_l3.3-ignition-v0.1-70b ( #6189 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-03 09:49:31 +02:00
Ettore Di Giacinto
3c75ea1e0e
chore(model gallery): add aurore-reveil_koto-small-7b-it ( #6188 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-03 09:47:57 +02:00
Ettore Di Giacinto
c5f911812f
chore(model gallery): add nousresearch_hermes-4-70b ( #6187 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-03 09:38:32 +02:00
LocalAI [bot]
d82922786a
chore: ⬆️ Update ggml-org/llama.cpp to 3de008208b9b8a33f49f979097a99b4d59e6e521 ( #6185 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-02 21:07:53 +00:00
dependabot[bot]
d9e9bb4c0e
chore(deps): bump github.com/charmbracelet/glamour from 0.7.0 to 0.10.0 ( #6183 )
...
Bumps [github.com/charmbracelet/glamour](https://github.com/charmbracelet/glamour ) from 0.7.0 to 0.10.0.
- [Release notes](https://github.com/charmbracelet/glamour/releases )
- [Changelog](https://github.com/charmbracelet/glamour/blob/master/.goreleaser.yml )
- [Commits](https://github.com/charmbracelet/glamour/compare/v0.7.0...v0.10.0 )
---
updated-dependencies:
- dependency-name: github.com/charmbracelet/glamour
dependency-version: 0.10.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-02 18:32:41 +00:00
dependabot[bot]
657027bec6
chore(deps): bump github.com/chasefleming/elem-go from 0.26.0 to 0.31.0 ( #6178 )
...
Bumps [github.com/chasefleming/elem-go](https://github.com/chasefleming/elem-go ) from 0.26.0 to 0.31.0.
- [Release notes](https://github.com/chasefleming/elem-go/releases )
- [Commits](https://github.com/chasefleming/elem-go/compare/v0.26.0...v0.31.0 )
---
updated-dependencies:
- dependency-name: github.com/chasefleming/elem-go
dependency-version: 0.31.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-02 17:25:56 +00:00
dependabot[bot]
2f5635308d
chore(deps): bump github.com/onsi/gomega from 1.36.2 to 1.38.2 ( #6179 )
...
Bumps [github.com/onsi/gomega](https://github.com/onsi/gomega ) from 1.36.2 to 1.38.2.
- [Release notes](https://github.com/onsi/gomega/releases )
- [Changelog](https://github.com/onsi/gomega/blob/master/CHANGELOG.md )
- [Commits](https://github.com/onsi/gomega/compare/v1.36.2...v1.38.2 )
---
updated-dependencies:
- dependency-name: github.com/onsi/gomega
dependency-version: 1.38.2
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-02 18:03:35 +02:00
dependabot[bot]
63b5338dbd
chore(deps): bump github.com/microcosm-cc/bluemonday from 1.0.26 to 1.0.27 ( #6177 )
...
chore(deps): bump github.com/microcosm-cc/bluemonday
Bumps [github.com/microcosm-cc/bluemonday](https://github.com/microcosm-cc/bluemonday ) from 1.0.26 to 1.0.27.
- [Release notes](https://github.com/microcosm-cc/bluemonday/releases )
- [Commits](https://github.com/microcosm-cc/bluemonday/compare/v1.0.26...v1.0.27 )
---
updated-dependencies:
- dependency-name: github.com/microcosm-cc/bluemonday
dependency-version: 1.0.27
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-02 12:16:24 +00:00
dependabot[bot]
3150174962
chore(deps): bump github.com/jaypipes/ghw from 0.12.0 to 0.19.1 ( #6176 )
...
Bumps [github.com/jaypipes/ghw](https://github.com/jaypipes/ghw ) from 0.12.0 to 0.19.1.
- [Release notes](https://github.com/jaypipes/ghw/releases )
- [Commits](https://github.com/jaypipes/ghw/compare/v0.12.0...v0.19.1 )
---
updated-dependencies:
- dependency-name: github.com/jaypipes/ghw
dependency-version: 0.19.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-02 13:04:35 +02:00
LocalAI [bot]
4330fdce33
chore: ⬆️ Update ggml-org/llama.cpp to d4d8dbe383e8b9600cbe8b42016e3a4529b51219 ( #6172 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-02 09:12:03 +02:00
Richard Palethorpe
fef8583144
fix(ci): Set default Darwin backend lang to python ( #6175 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-09-02 09:11:42 +02:00
LocalAI [bot]
d4d6a56a4f
chore: ⬆️ Update leejet/stable-diffusion.cpp to 4c6475f9176bf99271ccf5a2817b30a490b83db0 ( #6171 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-01 23:25:12 +02:00
Ettore Di Giacinto
2900a601a0
chore(backends): add stablediffusion-ggml and whisper for metal ( #6173 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-01 22:30:35 +02:00
Ettore Di Giacinto
43e0437db6
Revise GPU usage recommendations in documentation
...
Updated recommendations for GPU usage on Xorg.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-09-01 22:20:41 +02:00
Richard Palethorpe
976c159fdb
chore(ci): Build some Go based backends on Darwin ( #6164 )
...
* chore(ci): Build Go based backends on Darwin
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* chore(stablediffusion-ggml): Fixes for building on Darwin
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* chore(whisper): Build on Darwin
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-09-01 22:18:30 +02:00
LocalAI [bot]
969922ffec
chore: ⬆️ Update ggml-org/llama.cpp to e92d53b29e393fc4c0f9f1f7c3fe651be8d36faa ( #6169 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-01 08:06:54 +00:00
Ettore Di Giacinto
739573e41b
feat(flash_attention): set auto for flash_attention in llama.cpp ( #6168 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-31 17:59:09 +02:00
LocalAI [bot]
dbdf2908ad
chore: ⬆️ Update ggml-org/llama.cpp to 3d16b29c3bb1ec816ac0e782f20d169097063919 ( #6165 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-29 21:14:03 +00:00
dependabot[bot]
317f8641dc
chore(deps): bump the go_modules group with 4 updates ( #6161 )
...
Bumps the go_modules group with 4 updates: [github.com/containerd/containerd](https://github.com/containerd/containerd ), [github.com/gofiber/fiber/v2](https://github.com/gofiber/fiber ), [github.com/docker/docker](https://github.com/docker/docker ) and [github.com/ulikunitz/xz](https://github.com/ulikunitz/xz ).
Updates `github.com/containerd/containerd` from 1.7.19 to 1.7.27
- [Release notes](https://github.com/containerd/containerd/releases )
- [Changelog](https://github.com/containerd/containerd/blob/main/RELEASES.md )
- [Commits](https://github.com/containerd/containerd/compare/v1.7.19...v1.7.27 )
Updates `github.com/gofiber/fiber/v2` from 2.52.5 to 2.52.9
- [Release notes](https://github.com/gofiber/fiber/releases )
- [Commits](https://github.com/gofiber/fiber/compare/v2.52.5...v2.52.9 )
Updates `github.com/docker/docker` from 27.1.1+incompatible to 28.0.0+incompatible
- [Release notes](https://github.com/docker/docker/releases )
- [Commits](https://github.com/docker/docker/compare/v27.1.1...v28.0.0 )
Updates `github.com/ulikunitz/xz` from 0.5.9 to 0.5.14
- [Commits](https://github.com/ulikunitz/xz/compare/v0.5.9...v0.5.14 )
---
updated-dependencies:
- dependency-name: github.com/containerd/containerd
dependency-version: 1.7.27
dependency-type: direct:production
dependency-group: go_modules
- dependency-name: github.com/gofiber/fiber/v2
dependency-version: 2.52.9
dependency-type: direct:production
dependency-group: go_modules
- dependency-name: github.com/docker/docker
dependency-version: 28.0.0+incompatible
dependency-type: direct:production
dependency-group: go_modules
- dependency-name: github.com/ulikunitz/xz
dependency-version: 0.5.14
dependency-type: indirect
dependency-group: go_modules
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-29 08:12:12 +02:00
LocalAI [bot]
54ff70e451
feat(swagger): update swagger ( #6162 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-29 08:11:34 +02:00
LocalAI [bot]
723f01c87e
chore: ⬆️ Update ggml-org/llama.cpp to c97dc093912ad014f6d22743ede0d4d7fd82365a ( #6163 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-28 21:16:18 +00:00
Ettore Di Giacinto
79a41a5e07
fix: register backends to model-loader during installation ( #6159 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-28 19:11:02 +02:00
Matt Cowger
d0b6aa3f7d
feat(gallery): Add 'Get Config' button for models ( #6154 )
...
* feat(gallery): Add 'Get Config' button for models
This commit introduces a 'Get Config' button to the model gallery UI. This allows users to download and save the configuration file for a model without installing the model's weights.
Key changes:
- Added a getConfigButton element and integrated it into the gallery card.
- Created a new API endpoint /browse/config/model/:id to handle fetching and saving the model configuration.
- Refactored the InstallModel function to allow saving only the configuration file without downloading model weights.
- Added a ToYAML method on ModelConfig for serialization.
- Fixed button spacing in the gallery UI.
Signed-off-by: Matt Cowger <matt.cowger@sigmacomputing.com >
* Update for reviewer comments
Signed-off-by: Matt Cowger <matt.cowger@sigmacomputing.com >
---------
Signed-off-by: Matt Cowger <matt.cowger@sigmacomputing.com >
2025-08-28 18:32:49 +02:00
Ettore Di Giacinto
ad99399c6e
chore: stream errors while streaming SSE ( #6160 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-28 18:27:23 +02:00
Richard Palethorpe
e6ebfd3ba1
feat(whisper-cpp): Convert to Purego and add VAD ( #6087 )
...
* fix(ci): Avoid matching wrong backend with the same prefix
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* chore(whisper): Use Purego and enable VAD
This replaces the Whisper CGO bindings with our own Purego based module
to make compilation easier.
In addition this allows VAD models to be loaded by Whisper. There is not
much benefit now except that the same backend can be used for VAD and
transcription. Depending on upstream we may also be able to use GPU for
VAD in the future, but presently it is disabled.
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-28 17:25:18 +02:00
Ettore Di Giacinto
ead00a28b9
Add 'optimum-quanto' to requirements
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-28 13:32:03 +02:00
Ettore Di Giacinto
9621edb4c5
feat(diffusers): add support for wan2.2 ( #6153 )
...
* feat(diffusers): add support for wan2.2
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(ci): use ttl.sh for PRs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add ftfy deps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Revert "chore(ci): use ttl.sh for PRs"
This reverts commit c9fc3ecf28 .
* Simplify
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: do not pin torch/torchvision on cuda12
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-28 10:26:42 +02:00
Ettore Di Giacinto
7ce92f0646
fix: select portable environment if detected ( #6158 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-28 10:00:54 +02:00
LocalAI [bot]
6a4ab3c1e0
chore: ⬆️ Update ggml-org/llama.cpp to fbef0fad7a7c765939f6c9e322fa05cd52cf0c15 ( #6155 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-27 21:09:34 +00:00
Ettore Di Giacinto
83b85494c1
Update README with new resource links
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-27 16:24:23 +02:00
Matt Cowger
df6a80b38d
feat: Add a model refresh button to manually refresh on-disk yaml ( #6150 )
...
Add a model refresh button
2025-08-27 09:44:40 +02:00
LocalAI [bot]
21faa4114b
chore: ⬆️ Update ggml-org/llama.cpp to 8b696861364360770e9f61a3422d32941a477824 ( #6151 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-26 22:07:38 +00:00
Ettore Di Giacinto
e35ad56602
chore(docs): add backends README
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-26 19:39:05 +02:00
Ettore Di Giacinto
3be8b2d8e1
chore(refactor): cli -> cmd, update docs ( #6148 )
...
* chore(refactor): cli -> cmd
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update README
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-26 19:07:10 +02:00
Ettore Di Giacinto
900745bb4d
chore(model gallery): add opengvlab_internvl3_5-2b ( #6147 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-26 18:09:49 +02:00
Ettore Di Giacinto
15a7fc7e9a
chore(model gallery): add opengvlab_internvl3_5-4b ( #6146 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-26 17:55:53 +02:00
Ettore Di Giacinto
03dddec538
chore(model gallery): add opengvlab_internvl3_5-8b ( #6145 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-26 17:49:31 +02:00
Ettore Di Giacinto
3d34386712
chore(model gallery): add opengvlab_internvl3_5-14b ( #6144 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-26 17:17:49 +02:00
Ettore Di Giacinto
1b3f66018b
chore(model gallery): add opengvlab_internvl3_5-30b-a3b ( #6143 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-26 16:39:46 +02:00
Ettore Di Giacinto
4381e892b8
Revert "CI tests"
...
This reverts commit 913e132466 .
2025-08-26 15:26:23 +02:00
Ettore Di Giacinto
3c3f477854
feat(mlx-audio): Add mlx-audio backend ( #6138 )
...
* feat(mlx-audio): Add mlx-audio backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* improve loading
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* CI tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: set C_INCLUDE_PATH to point to python install
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-26 15:27:06 +02:00
Ettore Di Giacinto
f8a8cf3e95
feat(launcher): add LocalAI launcher app ( #6127 )
...
* Add launcher (WIP)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update gomod
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Cleanup, focus on systray
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Separate launcher from main
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add a way to identify the binary version
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Implement save config, and start on boot
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Save installed version as metadata
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Stop LocalAI on quit
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix goreleaser
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Check first if binary is there
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* do not show version if we don't have it
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to build on CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* use fyne package
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add to release
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fyne.Do
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* show WEBUI button only if LocalAI is started
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Default to localhost
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Show rel notes
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update logo
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small improvements and fix tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to fix e2e tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-26 14:22:04 +02:00
LocalAI [bot]
0fc88b3cdf
chore: ⬆️ Update ggml-org/llama.cpp to c4e9239064a564de7b94ee2b401ae907235a8fca ( #6139 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-26 12:18:58 +02:00
Ettore Di Giacinto
4993df81c3
fix(metal-llama.cpp): add all libutf8_validity
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-26 10:19:55 +02:00
Ettore Di Giacinto
599bc88c6c
fix(hipblas-llama.cpp): create symlink to libomp ( #6140 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-26 10:05:17 +02:00
LocalAI [bot]
1a0d06f3db
chore: ⬆️ Update ggml-org/llama.cpp to 043fb27d3808766d8ea8195bbd12359727264402 ( #6137 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-25 08:57:09 +02:00
LocalAI [bot]
5e1a8b3621
chore: ⬆️ Update ggml-org/whisper.cpp to 7745fcf32846006128f16de429cfe1677c963b30 ( #6136 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-24 21:12:13 +00:00
Ettore Di Giacinto
960e51e527
chore(diffusers): support both src and reference_images in diffusers ( #6135 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-24 22:03:08 +02:00
Ettore Di Giacinto
195aa22e77
chore(docs): update list of supported backends ( #6134 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-24 20:09:19 +02:00
Ettore Di Giacinto
be132fe816
Revise latest project news in README
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-24 11:50:20 +02:00
Ettore Di Giacinto
ff5d2dc8be
Revert "fix(rfdetr): use cpu torch for cpu builds" ( #6131 )
...
Revert "fix(rfdetr): use cpu torch for cpu builds (#6129 )"
This reverts commit fec8a36b36 .
2025-08-24 11:41:08 +02:00
Ettore Di Giacinto
c1cfa08226
chore(Dockerfile): drop python from images ( #6130 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-24 11:40:32 +02:00
Ettore Di Giacinto
fec8a36b36
fix(rfdetr): use cpu torch for cpu builds ( #6129 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-24 10:17:25 +02:00
Ettore Di Giacinto
5d4f5d2355
feat(backends): add CPU variant for diffusers backend ( #6128 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-24 10:17:10 +02:00
LocalAI [bot]
057248008f
chore: ⬆️ Update ggml-org/llama.cpp to 710dfc465a68f7443b87d9f792cffba00ed739fe ( #6126 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-24 08:41:39 +02:00
Ettore Di Giacinto
9f2c9cd691
feat(llama.cpp): Add gfx1201 support ( #6125 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-23 23:06:01 +02:00
Ettore Di Giacinto
6971f71a6c
Add mlx-vlm ( #6119 )
...
* Add mlx-vlm
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add to CI workflows
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add requirements-mps.txt
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Simplify
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-23 23:05:30 +02:00
Ettore Di Giacinto
1ba66d00f5
feat: bundle python inside backends ( #6123 )
...
* feat(backends): bundle python
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* test ci
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* vllm on self-hosted
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add clang
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to fix it for Mac
* Relocate links only when is portable
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make sure to call macosPortableEnv
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use self-hosted for vllm
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-23 22:36:39 +02:00
Ettore Di Giacinto
259383cf5e
chore(deps): bump llama.cpp to '45363632cbd593537d541e81b600242e0b3d47fc' ( #6122 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-23 08:39:10 +02:00
Ettore Di Giacinto
209c0694f5
Update backend.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-22 23:36:24 +02:00
Ettore Di Giacinto
0fd395d6ec
feat(diffusers): add MPS version ( #6121 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-22 23:14:54 +02:00
Ettore Di Giacinto
d04bd47116
chore(Makefile): small fixup for darwin MLX builds
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-22 08:52:29 +02:00
Ettore Di Giacinto
1d830ce7dd
feat(mlx): add mlx backend ( #6049 )
...
* chore: allow to install with pip
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make the backend to build and actually work
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* List models from system only
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add script to build darwin python backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Run protogen in libbackend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Detect if mps is available across python backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* CI: try to build backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Debug CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Index mlx-vlm
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Remove mlx-vlm
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop CI test
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-22 08:42:29 +02:00
LocalAI [bot]
6dccfb09f8
chore: ⬆️ Update ggml-org/llama.cpp to cd36b5e5c7fed2a3ac671dd542d579ca40b48b54 ( #6118 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-22 07:57:27 +02:00
LocalAI [bot]
e4d9cf8349
chore: ⬆️ Update ggml-org/llama.cpp to 7a6e91ad26160dd6dfb33d29ac441617422f28e7 ( #6116 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-20 21:05:39 +00:00
Ettore Di Giacinto
c899e90277
Update image-generation.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-20 10:37:11 +02:00
Ettore Di Giacinto
8193d18c7c
feat(img2img): Add support to Qwen Image Edit ( #6113 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-20 10:18:55 +02:00
LocalAI [bot]
2e4dc6456f
chore: ⬆️ Update ggml-org/llama.cpp to fb22dd07a639e81c7415e30b146f545f1a2f2caf ( #6112 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-20 09:01:36 +02:00
LocalAI [bot]
4594430a3e
feat(swagger): update swagger ( #6111 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-19 22:56:00 +02:00
Ettore Di Giacinto
9c7f92c81f
feat(p2p): automatically sync installed models between instances ( #6108 )
...
* feat(p2p): sync models between federated nodes
This change makes sure that between federated nodes all the models are
synced with each other.
Note: this works exclusively with models belonging to a gallery. It does
not sync files between the nodes, but rather it synces the node setup.
E.g. All the nodes needs to have configured the same galleries and
install models without any local editing.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make nodes stable
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups on syncing
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* ui: improve p2p view
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-19 19:37:46 +02:00
Ettore Di Giacinto
060037bcd4
Revert "chore(deps): bump transformers from 4.48.3 to 4.55.2 in /backend/python/coqui" ( #6105 )
...
Revert "chore(deps): bump transformers from 4.48.3 to 4.55.2 in /backend/pyth…"
This reverts commit 27ce570844 .
2025-08-19 15:00:33 +02:00
Ettore Di Giacinto
d9da4676b4
Revert "chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/python/coqui" ( #6104 )
...
Revert "chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/pyt…"
This reverts commit 42c7859ab1 .
2025-08-19 15:00:11 +02:00
Ettore Di Giacinto
5ef4c2e471
feat(diffusers): add torchvision to support qwen-image-edit ( #6103 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-19 12:05:48 +02:00
dependabot[bot]
27ce570844
chore(deps): bump transformers from 4.48.3 to 4.55.2 in /backend/python/coqui ( #6096 )
...
chore(deps): bump transformers in /backend/python/coqui
Bumps [transformers](https://github.com/huggingface/transformers ) from 4.48.3 to 4.55.2.
- [Release notes](https://github.com/huggingface/transformers/releases )
- [Commits](https://github.com/huggingface/transformers/compare/v4.48.3...v4.55.2 )
---
updated-dependencies:
- dependency-name: transformers
dependency-version: 4.55.2
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-19 09:44:01 +00:00
dependabot[bot]
42c7859ab1
chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/python/coqui ( #6099 )
...
chore(deps): bump torch in /backend/python/coqui
Bumps [torch](https://github.com/pytorch/pytorch ) from 2.3.1+cxx11.abi to 2.8.0.
- [Release notes](https://github.com/pytorch/pytorch/releases )
- [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md )
- [Commits](https://github.com/pytorch/pytorch/commits/v2.8.0 )
---
updated-dependencies:
- dependency-name: torch
dependency-version: 2.8.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-19 08:42:52 +00:00
Ettore Di Giacinto
e7e83d0fa6
Revert "chore(deps): bump intel-extension-for-pytorch from 2.3.110+xpu to 2.8.10+xpu in /backend/python/coqui" ( #6102 )
...
Revert "chore(deps): bump intel-extension-for-pytorch from 2.3.110+xpu to 2.8…"
This reverts commit c6dc1d86f1 .
2025-08-19 09:29:56 +02:00
dependabot[bot]
c6dc1d86f1
chore(deps): bump intel-extension-for-pytorch from 2.3.110+xpu to 2.8.10+xpu in /backend/python/coqui ( #6095 )
...
chore(deps): bump intel-extension-for-pytorch in /backend/python/coqui
Bumps intel-extension-for-pytorch from 2.3.110+xpu to 2.8.10+xpu.
---
updated-dependencies:
- dependency-name: intel-extension-for-pytorch
dependency-version: 2.8.10+xpu
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-19 07:09:47 +00:00
dependabot[bot]
6fd2e1964d
chore(deps): bump grpcio from 1.71.0 to 1.74.0 in /backend/python/coqui ( #6097 )
...
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.71.0 to 1.74.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.71.0...v1.74.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.74.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-19 08:11:58 +02:00
dependabot[bot]
49ae41b716
chore(deps): bump securego/gosec from 2.22.7 to 2.22.8 ( #6098 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.7 to 2.22.8.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.7...v2.22.8 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.8
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-19 08:11:26 +02:00
dependabot[bot]
b3f0ed62fd
chore(deps): bump actions/checkout from 4 to 5 ( #6101 )
...
Bumps [actions/checkout](https://github.com/actions/checkout ) from 4 to 5.
- [Release notes](https://github.com/actions/checkout/releases )
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md )
- [Commits](https://github.com/actions/checkout/compare/v4...v5 )
---
updated-dependencies:
- dependency-name: actions/checkout
dependency-version: '5'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-19 08:10:54 +02:00
LocalAI [bot]
4b9afc418b
chore: ⬆️ Update ggml-org/whisper.cpp to fc45bb86251f774ef817e89878bb4c2636c8a58f ( #6089 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-19 08:10:25 +02:00
LocalAI [bot]
e44ff8514b
chore: ⬆️ Update ggml-org/llama.cpp to 6d7f1117e3e3285d0c5c11b5ebb0439e27920082 ( #6088 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-19 08:09:49 +02:00
dependabot[bot]
2b6be10b6b
chore(deps): bump protobuf from 6.31.0 to 6.32.0 in /backend/python/transformers ( #6100 )
...
chore(deps): bump protobuf in /backend/python/transformers
Bumps [protobuf](https://github.com/protocolbuffers/protobuf ) from 6.31.0 to 6.32.0.
- [Release notes](https://github.com/protocolbuffers/protobuf/releases )
- [Changelog](https://github.com/protocolbuffers/protobuf/blob/main/protobuf_release.bzl )
- [Commits](https://github.com/protocolbuffers/protobuf/commits )
---
updated-dependencies:
- dependency-name: protobuf
dependency-version: 6.32.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-19 05:09:17 +00:00
Ettore Di Giacinto
1361d844a1
chore(model gallery): add lfm2-1.2b ( #6092 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-18 22:24:50 +02:00
Ettore Di Giacinto
fcc521cae5
chore(model gallery): add lfm2-vl-1.6b ( #6091 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-18 22:21:35 +02:00
Ettore Di Giacinto
8cad7138be
chore(model gallery): add lfm2-vl-450m ( #6090 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-18 22:17:03 +02:00
Richard Palethorpe
ebd1db2f09
chore(ci): Build modified backends on PR ( #6086 )
...
* chore(stablediffusion-ggml): rm redundant comment
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* chore(ci): Build modified backends on PR
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-08-18 17:56:34 +02:00
LocalAI [bot]
7920d75805
chore: ⬆️ Update ggml-org/llama.cpp to 21c17b5befc5f6be5992bc87fc1ba99d388561df ( #6084 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-18 08:26:58 +00:00
Ettore Di Giacinto
1d0e24a865
chore(model gallery): add impish_longtail_12b ( #6082 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-17 09:41:01 +02:00
LocalAI [bot]
9eed5ef872
chore: ⬆️ Update ggml-org/llama.cpp to 1fe00296f587dfca0957e006d146f5875b61e43d ( #6079 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-16 21:10:03 +00:00
Fabio Erculiani
39ab80442a
Tune the "dark gray" font color in the webui to make it more readable ( #6078 )
...
Tune the "dark gray" font color in the LocalAI webui to make it more readable.
2025-08-16 21:16:25 +02:00
Ettore Di Giacinto
1b101df2c0
chore: InTrustedRoot -> VerifyPath
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-16 16:30:13 +02:00
Richard Palethorpe
784bd5db33
chore(build): Use Purego with stablediffusion backend ( #6067 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-08-16 12:21:29 +02:00
Ettore Di Giacinto
b8b1ca782c
chore(model gallery): add wingless_imp_8b-i1 ( #6077 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-16 08:45:38 +02:00
Ettore Di Giacinto
1149fb66d3
chore(model gallery): add thedrummer_gemma-3-r1-4b-v1 ( #6076 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-16 08:39:10 +02:00
LocalAI [bot]
243e86176e
chore: ⬆️ Update ggml-org/llama.cpp to 5e6229a8409ac786e62cb133d09f1679a9aec13e ( #6070 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-16 08:38:57 +02:00
Ettore Di Giacinto
8da38a0d10
chore(model gallery): add thedrummer_gemma-3-r1-12b-v1 ( #6075 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-16 08:35:49 +02:00
Ettore Di Giacinto
60786fc876
chore(model gallery): add thedrummer_gemma-3-r1-27b-v1 ( #6074 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-16 08:31:42 +02:00
LocalAI [bot]
9486b88a25
chore: ⬆️ Update ggml-org/whisper.cpp to 040510a132f0a9b51d4692b57a6abfd8c9660696 ( #6069 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-16 08:30:54 +02:00
Ettore Di Giacinto
bef4c10629
feat(ui): General improvements ( #6072 )
...
* wip
* Simplify stop
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Improve UI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Show installed backends at the index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Imporve UI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-16 07:44:50 +02:00
LocalAI [bot]
80f15851c5
chore(model-gallery): ⬆️ update checksum ( #6071 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-15 22:29:17 +02:00
Ettore Di Giacinto
22067e3384
chore(rocm): bump rocm image, add gfx1200 support ( #6065 )
...
Fixes: https://github.com/mudler/LocalAI/issues/6044
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-15 16:36:54 +02:00
Ettore Di Giacinto
4fbd639463
chore(ci): fixup builds for darwin and hipblas
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-15 15:58:02 +02:00
Ettore Di Giacinto
70f7d0c25f
Revert "chore(build): Convert stablediffusion-ggml backend to Purego ( #5989 )" ( #6064 )
...
This reverts commit 94cb20ae7f .
2025-08-15 15:18:40 +02:00
Ettore Di Giacinto
576e821298
chore(deps): bump llama.cpp to 'df36bce667bf14f8e538645547754386f9516326 ( #6062 )
...
chore(deps): bump llama.cpp to 'df36bce667bf14f8e538645547754386f9516326'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-15 13:28:15 +02:00
Ettore Di Giacinto
7293f26fcf
chore(ci): fix darwin image publish
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-15 08:51:44 +02:00
Ettore Di Giacinto
79973a28ad
chore(model gallery): add gemma-3-270m-it-qat ( #6063 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-15 08:41:38 +02:00
Ettore Di Giacinto
8ab51509cc
Update Makefile
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-15 08:33:25 +02:00
Ettore Di Giacinto
b3384e5428
Update Makefile
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-15 08:08:24 +02:00
Ettore Di Giacinto
7050c9f69d
feat(webui): add import/edit model page ( #6050 )
...
* feat(webui): add import/edit model page
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Convert to a YAML editor
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Pass by the baseurl
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Simplify
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Improve visibility of the yaml editor
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add test file
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make reset work
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Emit error only if we can't delete the model yaml file
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-14 23:48:09 +02:00
Ettore Di Giacinto
089efe05fd
feat(backends): add system backend, refactor ( #6059 )
...
- Add a system backend path
- Refactor and consolidate system information in system state
- Use system state in all the components to figure out the system paths
to used whenever needed
- Refactor BackendConfig -> ModelConfig. This was otherway misleading as
now we do have a backend configuration which is not the model config.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-14 19:38:26 +02:00
Ettore Di Giacinto
253b7537dc
fix(llama-cpp/darwin): make sure to bundle libutf8 libs ( #6060 )
...
fix(darwin): make sure to bundle libutf8_validity
Plus some refactoring, use makefile
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-14 17:56:35 +02:00
Ettore Di Giacinto
19c92c70c5
fix(backend-detection): default to CPU if there is less than 4GB of GPU available ( #6057 )
...
fix(gpu-detection): default to CPU if there is less than 4GB of GPU available
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-14 16:57:33 +02:00
Ettore Di Giacinto
b52bfaf1b3
fix: do not show invalid backends ( #6058 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-14 13:01:56 +02:00
Ettore Di Giacinto
bf60ca5bf0
Update Makefile
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-14 11:53:43 +02:00
LocalAI [bot]
2b44467bd1
chore: ⬆️ Update ggml-org/llama.cpp to 29c8fbe4e05fd23c44950d0958299e25fbeabc5c ( #6054 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-14 09:19:15 +02:00
LocalAI [bot]
8c1f4a131e
chore: ⬆️ Update ggml-org/whisper.cpp to 16c2924cb2c4b5c9f79220aa7708eb5b346b029b ( #6055 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-13 21:08:22 +00:00
Ettore Di Giacinto
10a3f0bd92
fix: chmod grpc processes only if needed ( #6051 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-13 12:08:53 +02:00
LocalAI [bot]
72f4d541d0
chore: ⬆️ Update ggml-org/llama.cpp to f4586ee5986d6f965becb37876d6f3666478a961 ( #6048 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-13 08:33:48 +02:00
LocalAI [bot]
9f812fdb84
chore: ⬆️ Update ggml-org/whisper.cpp to 5527454cdb3e15d7e2b8a6e2afcb58cb61651fd2 ( #6047 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-12 21:12:07 +00:00
LocalAI [bot]
b70ee45fff
docs: ⬆️ update docs version mudler/LocalAI ( #6046 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-12 22:05:50 +02:00
dependabot[bot]
9d9c853541
chore(deps): bump grpcio from 1.71.0 to 1.74.0 in /backend/python/transformers ( #6013 )
...
chore(deps): bump grpcio in /backend/python/transformers
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.71.0 to 1.74.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.71.0...v1.74.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.74.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 22:05:16 +02:00
Ettore Di Giacinto
18fcd8557c
fix(llama.cpp): support gfx1200 ( #6045 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-12 22:04:30 +02:00
dependabot[bot]
d8e27c38d7
chore(deps): bump oneccl-bind-pt from 2.3.100+xpu to 2.8.0+xpu in /backend/python/common/template ( #6016 )
...
chore(deps): bump oneccl-bind-pt in /backend/python/common/template
Bumps oneccl-bind-pt from 2.3.100+xpu to 2.8.0+xpu.
---
updated-dependencies:
- dependency-name: oneccl-bind-pt
dependency-version: 2.8.0+xpu
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 18:57:20 +00:00
dependabot[bot]
3b0dc87932
chore(deps): bump torch from 2.3.1+cxx11.abi to 2.8.0 in /backend/python/common/template ( #6025 )
...
chore(deps): bump torch in /backend/python/common/template
Bumps [torch](https://github.com/pytorch/pytorch ) from 2.3.1+cxx11.abi to 2.8.0.
- [Release notes](https://github.com/pytorch/pytorch/releases )
- [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md )
- [Commits](https://github.com/pytorch/pytorch/commits/v2.8.0 )
---
updated-dependencies:
- dependency-name: torch
dependency-version: 2.8.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 17:58:33 +00:00
dependabot[bot]
2374485222
chore(deps): bump actions/download-artifact from 4 to 5 ( #6015 )
...
Bumps [actions/download-artifact](https://github.com/actions/download-artifact ) from 4 to 5.
- [Release notes](https://github.com/actions/download-artifact/releases )
- [Commits](https://github.com/actions/download-artifact/compare/v4...v5 )
---
updated-dependencies:
- dependency-name: actions/download-artifact
dependency-version: '5'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 18:54:57 +02:00
dependabot[bot]
0ca1765c17
chore(deps): bump actions/checkout from 4 to 5 ( #6014 )
...
Bumps [actions/checkout](https://github.com/actions/checkout ) from 4 to 5.
- [Release notes](https://github.com/actions/checkout/releases )
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md )
- [Commits](https://github.com/actions/checkout/compare/v4...v5 )
---
updated-dependencies:
- dependency-name: actions/checkout
dependency-version: '5'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 18:54:39 +02:00
dependabot[bot]
90b5ed9a1e
chore(deps): bump intel-extension-for-pytorch from 2.3.110+xpu to 2.8.10+xpu in /backend/python/common/template ( #6034 )
...
chore(deps): bump intel-extension-for-pytorch
Bumps intel-extension-for-pytorch from 2.3.110+xpu to 2.8.10+xpu.
---
updated-dependencies:
- dependency-name: intel-extension-for-pytorch
dependency-version: 2.8.10+xpu
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 18:44:33 +02:00
dependabot[bot]
d438b769da
chore(deps): bump intel-extension-for-pytorch from 2.3.110+xpu to 2.8.10+xpu in /backend/python/bark ( #6043 )
...
chore(deps): bump intel-extension-for-pytorch in /backend/python/bark
Bumps intel-extension-for-pytorch from 2.3.110+xpu to 2.8.10+xpu.
---
updated-dependencies:
- dependency-name: intel-extension-for-pytorch
dependency-version: 2.8.10+xpu
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 18:44:05 +02:00
dependabot[bot]
2e4bd1e33d
chore(deps): bump oneccl-bind-pt from 2.3.100+xpu to 2.8.0+xpu in /backend/python/rerankers ( #6021 )
...
chore(deps): bump oneccl-bind-pt in /backend/python/rerankers
Bumps oneccl-bind-pt from 2.3.100+xpu to 2.8.0+xpu.
---
updated-dependencies:
- dependency-name: oneccl-bind-pt
dependency-version: 2.8.0+xpu
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 16:04:54 +00:00
dependabot[bot]
ff73800970
chore(deps): bump grpcio from 1.71.0 to 1.74.0 in /backend/python/exllama2 ( #6019 )
...
chore(deps): bump grpcio in /backend/python/exllama2
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.71.0 to 1.74.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.71.0...v1.74.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.74.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 16:42:46 +02:00
Richard Palethorpe
94cb20ae7f
chore(build): Convert stablediffusion-ggml backend to Purego ( #5989 )
...
* Try converting SD to purego
* chore(build): Use Purego with stablediffusion backend
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-08-12 16:42:15 +02:00
dependabot[bot]
47c20f9adb
chore(deps): bump grpcio from 1.71.0 to 1.74.0 in /backend/python/rerankers ( #6022 )
...
chore(deps): bump grpcio in /backend/python/rerankers
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.71.0 to 1.74.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.71.0...v1.74.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.74.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 15:24:48 +02:00
dependabot[bot]
a7fe153630
chore(deps): bump grpcio from 1.71.0 to 1.74.0 in /backend/python/bark ( #6033 )
...
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.71.0 to 1.74.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.71.0...v1.74.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.74.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 15:16:00 +02:00
dependabot[bot]
27519d2233
chore(deps): bump grpcio from 1.71.0 to 1.74.0 in /backend/python/common/template ( #6035 )
...
chore(deps): bump grpcio in /backend/python/common/template
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.71.0 to 1.74.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.71.0...v1.74.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.74.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 15:15:28 +02:00
dependabot[bot]
8cab0f880b
chore(deps): bump sentence-transformers from 5.0.0 to 5.1.0 in /backend/python/transformers ( #6028 )
...
chore(deps): bump sentence-transformers in /backend/python/transformers
Bumps [sentence-transformers](https://github.com/UKPLab/sentence-transformers ) from 5.0.0 to 5.1.0.
- [Release notes](https://github.com/UKPLab/sentence-transformers/releases )
- [Commits](https://github.com/UKPLab/sentence-transformers/compare/v5.0.0...v5.1.0 )
---
updated-dependencies:
- dependency-name: sentence-transformers
dependency-version: 5.1.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 15:15:07 +02:00
dependabot[bot]
8c48b250c4
chore(deps): bump grpcio from 1.71.0 to 1.74.0 in /backend/python/diffusers ( #6037 )
...
chore(deps): bump grpcio in /backend/python/diffusers
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.71.0 to 1.74.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.71.0...v1.74.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.74.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 15:14:35 +02:00
dependabot[bot]
ba802c2ee4
chore(deps): bump grpcio from 1.71.0 to 1.74.0 in /backend/python/vllm ( #6036 )
...
Bumps [grpcio](https://github.com/grpc/grpc ) from 1.71.0 to 1.74.0.
- [Release notes](https://github.com/grpc/grpc/releases )
- [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md )
- [Commits](https://github.com/grpc/grpc/compare/v1.71.0...v1.74.0 )
---
updated-dependencies:
- dependency-name: grpcio
dependency-version: 1.74.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-12 15:14:15 +02:00
Ettore Di Giacinto
429bb7a88c
chore(model gallery): add baichuan-inc_baichuan-m2-32 ( #6042 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-12 09:43:13 +02:00
LocalAI [bot]
b2e8b6d1aa
chore: ⬆️ Update ggml-org/llama.cpp to be48528b068111304e4a0bb82c028558b5705f05 ( #6012 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-11 21:06:10 +00:00
LocalAI [bot]
fba5b557a1
chore: ⬆️ Update ggml-org/whisper.cpp to b02242d0adb5c6c4896d59ac86d9ec9fe0d0fe33 ( #6009 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-11 12:54:41 +02:00
LocalAI [bot]
6db19c5cb9
chore: ⬆️ Update ggml-org/llama.cpp to 79c1160b073b8148a404f3dd2584be1606dccc66 ( #6006 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-11 12:54:21 +02:00
Ettore Di Giacinto
5428678209
chore(ci): more cleanup
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-11 10:10:38 +02:00
LocalAI [bot]
06129139eb
chore(model-gallery): ⬆️ update checksum ( #6010 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-11 07:54:01 +02:00
Ettore Di Giacinto
05757e2738
feat(backends install): allow to specify name and alias during manual installation ( #5971 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-10 10:05:53 +02:00
Ettore Di Giacinto
240b790f29
chore(model gallery): add impish_nemo_12b ( #6007 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-10 10:05:20 +02:00
Ettore Di Giacinto
5f221f5946
fix(l4t-diffusers): add sentencepiece ( #6005 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-09 09:08:35 +02:00
LocalAI [bot]
def7cdc0bf
chore: ⬆️ Update ggml-org/llama.cpp to cd6983d56d2cce94ecb86bb114ae8379a609073c ( #6003 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-09 08:41:58 +02:00
Ettore Di Giacinto
ea9bf3dba2
Update backend.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-08 23:00:47 +02:00
Ettore Di Giacinto
b8eca530b6
feat(diffusers): add builds for nvidia-l4t ( #6004 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-08 22:48:38 +02:00
Ettore Di Giacinto
47034ddacd
chore(deps): bump edgevpn ( #6001 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-08 16:23:18 +02:00
Ettore Di Giacinto
9a41331855
chore(model gallery): add outetts ( #6000 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-08 12:55:58 +02:00
Ettore Di Giacinto
facc0181df
chore(model gallery): add chatterbox ( #5999 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-08 12:53:13 +02:00
Ettore Di Giacinto
4733adb983
chore: add Dia to the model gallery, fix backend ( #5998 )
...
* fix: correctly call OuteTTS and DiaTTS
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(model gallery): add dia
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-08 12:40:16 +02:00
Ettore Di Giacinto
326fda3223
chore(model gallery): add tarek07_nomad-llama-70b ( #5997 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-08 12:06:20 +02:00
Ettore Di Giacinto
abf61e5b42
chore(model gallery): add openai-gpt-oss-20b-abliterated-uncensored-neo-imatrix ( #5996 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-08 11:14:46 +02:00
Ettore Di Giacinto
2ae45e7635
chore(model gallery): add huihui-ai_huihui-gpt-oss-20b-bf16-abliterated ( #5995 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-08 11:01:52 +02:00
lnnt
7d41551e10
docs: update links in advanced-usage and models documentation ( #5994 )
...
* docs: update links in advanced-usage and models documentation
* docs: update links in advanced-usage and models documentation
2025-08-08 10:23:42 +02:00
LocalAI [bot]
6fbd720515
chore: ⬆️ Update ggml-org/whisper.cpp to 4245c77b654cd384ad9f53a4a302be716b3e5861 ( #5993 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-08 08:07:17 +02:00
LocalAI [bot]
4e40a8d1ed
chore: ⬆️ Update ggml-org/llama.cpp to a0552c8beef74e843bb085c8ef0c63f9ed7a2b27 ( #5992 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-07 21:13:14 +00:00
Ettore Di Giacinto
003b9292fe
feat(transformers): add support to Dia ( #5991 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-07 21:51:52 +02:00
Ettore Di Giacinto
09457b9221
chore(model gallery): add qwen_qwen3-4b-thinking-2507 ( #5988 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-07 09:28:37 +02:00
Ettore Di Giacinto
41aa7e107f
chore(model gallery): add qwen_qwen3-4b-instruct-2507 ( #5987 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-07 09:20:15 +02:00
Ettore Di Giacinto
bda875f962
chore(ci): run bark CI job to self-hosted
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-07 08:40:15 +02:00
LocalAI [bot]
224063f0f7
feat(swagger): update swagger ( #5983 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-07 00:32:11 +02:00
Ettore Di Giacinto
89978c8b57
fix(harmony): improve template by adding reasoning effort and system_prompt ( #5985 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-07 00:31:37 +02:00
Ettore Di Giacinto
987b5dcac1
chore(model gallery): add openai_gpt-oss-20b-neo ( #5986 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-07 00:31:17 +02:00
Ettore Di Giacinto
ec1276e5a9
fix(llama.cpp): do not default to linear rope ( #5982 )
...
This seems to somehow sneaked in during the initial pass to gRPC server,
instead of setting linear rope when required, we did default to it if
not specified.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-06 23:20:28 +02:00
LocalAI [bot]
61ba98d43d
chore: ⬆️ Update ggml-org/llama.cpp to e725a1a982ca870404a9c4935df52466327bbd02 ( #5984 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-06 21:17:20 +00:00
Ettore Di Giacinto
b9a25b16e6
feat: add reasoning effort and metadata to template ( #5981 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-06 21:56:05 +02:00
Ettore Di Giacinto
6a8149e1fd
fix: build kokoro-hipblas on self-hosted
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-06 15:50:54 +02:00
Ettore Di Giacinto
9c2840ac38
feat(kokoro): complete kokoro integration ( #5978 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-06 15:23:29 +02:00
Ettore Di Giacinto
20a70e1244
feat(backends): add KittenTTS ( #5977 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-06 12:38:45 +02:00
Ettore Di Giacinto
3295a298f4
feat(webui): allow to specify image size ( #5976 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-06 12:38:02 +02:00
Ettore Di Giacinto
da6f37f000
Update qwen-image.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-06 10:40:46 +02:00
Ettore Di Giacinto
c092633cd7
feat(models): add support to qwen-image ( #5975 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-06 10:36:53 +02:00
Ettore Di Giacinto
7e2a522229
Update harmony.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-05 23:58:48 +02:00
LocalAI [bot]
03e8592450
chore: ⬆️ Update ggml-org/llama.cpp to fd1234cb468935ea087d6929b2487926c3afff4b ( #5972 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-05 23:14:43 +02:00
Ettore Di Giacinto
f207bd1427
Update backend.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-05 23:14:11 +02:00
Ettore Di Giacinto
a5c0fe31c3
chore(models): add gpt-oss-120b ( #5974 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-05 23:13:24 +02:00
Ettore Di Giacinto
c68907ac65
chore(models): add gpt-oss-20b ( #5973 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-05 23:08:34 +02:00
Ettore Di Giacinto
9087ddc4de
chore(deps): bump torch and sentence-transformers ( #5969 )
...
* chore(deps): bump torch and sentence-transformers
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(ci): add backend build tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: move jobs to self-hosted
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-05 19:45:20 +02:00
Ettore Di Giacinto
33bebd5114
chore(deps): bump torch and diffusers ( #5970 )
...
* chore(ci): add backend build tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(deps): bump torch and diffusers
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(ci): run diffusers/hipblas on self-hosted
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(ci): do not publish darwin if building from PRs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-05 14:47:01 +02:00
LocalAI [bot]
2913676157
chore: ⬆️ Update ggml-org/llama.cpp to 41613437ffee0dbccad684fc744788bc504ec213 ( #5968 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-04 23:16:30 +02:00
LocalAI [bot]
e83652489c
docs: ⬆️ update docs version mudler/LocalAI ( #5967 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-04 21:00:23 +00:00
Richard Palethorpe
d6274eaf4a
chore(build): Rename sycl to intel ( #5964 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-08-04 11:01:28 +02:00
LocalAI [bot]
4d90971424
chore: ⬆️ Update ggml-org/llama.cpp to d31192b4ee1441bbbecd3cbf9e02633368bdc4f5 ( #5965 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-03 21:03:20 +00:00
Ettore Di Giacinto
90f5639639
feat(backends): allow backends to not have a metadata file ( #5963 )
...
In this case we generate one on the fly and we infer the metadata we
can.
Obviously this have the side effect of not being able to register
potential aliases.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-03 16:47:02 +02:00
Ettore Di Giacinto
a35a701052
feat(backends): install from local path ( #5962 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-03 14:24:50 +02:00
Ettore Di Giacinto
3d8ec72dbf
chore(stable-diffusion): bump, set GGML_MAX_NAME ( #5961 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-08-03 10:47:02 +02:00
LocalAI [bot]
2a9d675d62
chore: ⬆️ Update ggml-org/llama.cpp to 5c0eb5ef544aeefd81c303e03208f768e158d93c ( #5959 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-08-02 23:35:24 +02:00
LocalAI [bot]
c782e8abf1
chore: ⬆️ Update ggml-org/whisper.cpp to 0becabc8d68d9ffa6ddfba5240e38cd7a2642046 ( #5958 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-02 21:04:13 +00:00
LocalAI [bot]
a1e1942d83
docs: ⬆️ update docs version mudler/LocalAI ( #5956 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-01 22:14:23 +02:00
Dedy F. Setyawan
787302b204
fix(docs): Improve responsiveness of tables ( #5954 )
...
Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com >
2025-08-01 22:13:53 +02:00
LocalAI [bot]
0b085089b9
chore: ⬆️ Update ggml-org/llama.cpp to daf2dd788066b8b239cb7f68210e090c2124c199 ( #5951 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-08-01 08:25:36 +02:00
LocalAI [bot]
624f3b1fc8
feat(swagger): update swagger ( #5950 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-31 21:04:23 +00:00
Richard Palethorpe
c07bc55fee
fix(intel): Set GPU vendor on Intel images and cleanup ( #5945 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-31 19:44:46 +02:00
Ettore Di Giacinto
173e0774c0
chore(model gallery): add flux.1-krea-dev-ggml ( #5949 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-31 18:32:06 +02:00
Ettore Di Giacinto
8ece26ab7c
chore(model gallery): add flux.1-dev-ggml-abliterated-v2-q8_0 ( #5948 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-31 17:23:48 +02:00
Ettore Di Giacinto
d704cc7970
chore(model gallery): add flux.1-dev-ggml-q8_0 ( #5947 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-31 17:13:19 +02:00
Ettore Di Giacinto
ab17baaae1
chore(capability): improve messages ( #5944 )
...
* chore(capability): improve messages
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: isolate to constants, do not detect from the first gpu
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-31 16:25:19 +02:00
Ettore Di Giacinto
ca358fcdca
feat(stablediffusion-ggml): allow to load loras ( #5943 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-31 16:25:05 +02:00
Ettore Di Giacinto
9aadfd485f
chore: update swagger ( #5946 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-31 16:22:27 +02:00
LocalAI [bot]
da3b0850de
chore: ⬆️ Update ggml-org/whisper.cpp to f7502dca872866a310fe69d30b163fa87d256319 ( #5941 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-31 09:26:30 +02:00
LocalAI [bot]
8b1e8b4cda
chore: ⬆️ Update ggml-org/llama.cpp to e9192bec564780bd4313ad6524d20a0ab92797db ( #5940 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-31 09:26:02 +02:00
Ettore Di Giacinto
3d22bfc27c
feat(stablediffusion-ggml): add support to ref images (flux Kontext) ( #5935 )
...
* feat(stablediffusion-ggml): add support to ref images
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add it to the model gallery
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-30 22:42:34 +02:00
Ettore Di Giacinto
4438b4361e
chore(model gallery): add qwen_qwen3-30b-a3b-thinking-2507 ( #5939 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-30 21:18:56 +02:00
Ettore Di Giacinto
04bad9a2da
chore(model gallery): add arcee-ai_afm-4.5b ( #5938 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-30 15:37:07 +02:00
Ettore Di Giacinto
8235e53602
chore(model gallery): add qwen_qwen3-30b-a3b-instruct-2507 ( #5936 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-30 15:29:34 +02:00
LocalAI [bot]
eb5c3670f1
chore: ⬆️ Update ggml-org/llama.cpp to aa79524c51fb014f8df17069d31d7c44b9ea6cb8 ( #5934 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-29 21:05:00 +00:00
LocalAI [bot]
89e61fca90
chore: ⬆️ Update ggml-org/whisper.cpp to d0a9d8c7f8f7b91c51d77bbaa394b915f79cde6b ( #5932 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-29 08:02:01 +02:00
LocalAI [bot]
9d6efe8842
chore: ⬆️ Update leejet/stable-diffusion.cpp to f6b9aa1a4373e322ff12c15b8a0749e6dd6f0253 ( #5930 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-29 08:01:30 +02:00
LocalAI [bot]
60726d16f2
chore: ⬆️ Update ggml-org/llama.cpp to 8ad7b3e65b5834e5574c2f5640056c9047b5d93b ( #5931 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-29 08:01:03 +02:00
LocalAI [bot]
9d7ec09ec0
docs: ⬆️ update docs version mudler/LocalAI ( #5929 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-28 21:03:44 +00:00
Ettore Di Giacinto
36179ffbed
fix(backend gallery): intel images for python-based backends, re-add exllama2 ( #5928 )
...
chore(backend gallery): fix intel images for python-based backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-28 15:15:19 +02:00
LocalAI [bot]
d25145e641
chore: ⬆️ Update ggml-org/llama.cpp to bf78f5439ee8e82e367674043303ebf8e92b4805 ( #5927 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-27 21:08:32 +00:00
Ettore Di Giacinto
949e5b9be8
feat(rfdetr): add object detection API ( #5923 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-27 22:02:51 +02:00
Ettore Di Giacinto
73ecb7f90b
chore: drop assistants endpoint ( #5926 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-27 21:06:09 +02:00
Ettore Di Giacinto
053bed6e5f
feat: normalize search ( #5925 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-27 11:51:28 +02:00
LocalAI [bot]
932360bf7e
chore: ⬆️ Update ggml-org/llama.cpp to 11dd5a44eb180e1d69fac24d3852b5222d66fb7f ( #5921 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-27 09:50:56 +02:00
LocalAI [bot]
6d0b52843f
chore: ⬆️ Update ggml-org/whisper.cpp to e7bf0294ec9099b5fc21f5ba969805dfb2108cea ( #5922 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-27 09:42:28 +02:00
LocalAI [bot]
078c22f485
docs: ⬆️ update docs version mudler/LocalAI ( #5920 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-26 20:58:54 +00:00
Ettore Di Giacinto
6ef3852de5
chore(docs): fixup tag
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-26 21:25:07 +02:00
Ettore Di Giacinto
a8057b952c
fix(cuda): be consistent with image tag naming ( #5916 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-26 08:30:59 +02:00
Ettore Di Giacinto
fd5c1d916f
chore(docs): add documentation on backend detection override ( #5915 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-26 08:18:31 +02:00
LocalAI [bot]
5ce982b9c9
chore: ⬆️ Update ggml-org/llama.cpp to c7f3169cd523140a288095f2d79befb20a0b73f4 ( #5913 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 23:08:20 +02:00
Ettore Di Giacinto
47ccfccf7a
fix(ci): add nvidia-l4t capability to l4t images ( #5914 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-25 22:45:09 +02:00
LocalAI [bot]
a760f7ff39
docs: ⬆️ update docs version mudler/LocalAI ( #5912 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 22:15:16 +02:00
Ettore Di Giacinto
facf7625f3
fix(vulkan): use correct image suffix ( #5911 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 19:20:20 +02:00
Ettore Di Giacinto
b3600b3c50
feat(backend gallery): add mirrors ( #5910 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 19:20:08 +02:00
Ettore Di Giacinto
f0b47cfe6a
fix(backends gallery): trim string when reading cap from file ( #5909 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 18:10:02 +02:00
Ettore Di Giacinto
ee625fc34e
fix(backends gallery): pass-by backend galleries to the model service ( #5906 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 16:38:09 +02:00
Ettore Di Giacinto
693aa0b5de
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-25 11:51:23 +02:00
Ettore Di Giacinto
3973e6e5da
fix(install.sh): update to use the new binary naming ( #5903 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 10:43:22 +02:00
LocalAI [bot]
fb6ec68090
chore: ⬆️ Update ggml-org/whisper.cpp to 7de8dd783f7b2eab56bff6bbc5d3369e34f0e77f ( #5902 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 08:40:24 +02:00
LocalAI [bot]
0301fc7c46
chore: ⬆️ Update leejet/stable-diffusion.cpp to eed97a5e1d054f9c1e7ac01982ae480411d4157e ( #5901 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 08:40:06 +02:00
LocalAI [bot]
813cb4296d
chore: ⬆️ Update ggml-org/llama.cpp to 3f4fc97f1d745f1d5d3c853949503136d419e6de ( #5900 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 08:39:44 +02:00
Ettore Di Giacinto
deda3a4972
Update build documentation
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-24 22:53:08 +02:00
Ettore Di Giacinto
a28f27604a
Update backends.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-24 16:18:25 +02:00
Richard Palethorpe
8fe9fa98f2
fix(stablediffusion-cpp): Switch back to upstream and update ( #5880 )
...
* sync(stablediffusion-cpp): Switch back to upstream and update
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(stablediffusion-ggml): NULL terminate options array to prevent segfault
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(build): Add BUILD_TYPE and BASE_IMAGE to all backends
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-24 16:03:18 +02:00
Nathaniel Hyson
4db1b80278
Update quickstart.md ( #5898 )
...
Fixed spelling mistake
Signed-off-by: Nathaniel Hyson <Shinrai@users.noreply.github.com >
2025-07-24 15:04:02 +02:00
Dave
b3c2a3c257
fix: untangle pkg and core ( #5896 )
...
* migrate core/system to pkg/system - it has no dependencies FROM core, and IS USED in pkg
Signed-off-by: Dave Lee <dave@gray101.com >
* move pkg/templates up to core/templates -- nothing in pkg references it, but it does reference core.
Signed-off-by: Dave Lee <dave@gray101.com >
* remove extra check, len of nil is 0
Signed-off-by: Dave Lee <dave@gray101.com >
* move pkg/startup to core/startup -- it does have important and unfixable dependencies on core
Signed-off-by: Dave Lee <dave@gray101.com >
---------
Signed-off-by: Dave Lee <dave@gray101.com >
2025-07-24 15:03:41 +02:00
LocalAI [bot]
61c2304638
chore: ⬆️ Update ggml-org/llama.cpp to a86f52b2859dae4db5a7a0bbc0f1ad9de6b43ec6 ( #5894 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-24 15:02:37 +02:00
Ettore Di Giacinto
92c5ab97e2
chore(Makefile): drop unused targets ( #5893 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-24 14:49:50 +02:00
LocalAI [bot]
76e471441c
chore: ⬆️ Update richiejp/stable-diffusion.cpp to 10c6501bd05a697e014f1bee3a84e5664290c489 ( #5732 )
...
⬆️ Update richiejp/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-23 21:09:02 +00:00
Dave
9cecf5e7ac
fix: rename Dockerfile.go --> Dockerfile.golang to avoid IDE errors ( #5892 )
...
extract up and out Dockerfile.go --> Dockerfile.golang rename. Prevents syntax highlighting and IDE errors
Signed-off-by: Dave Lee <dave@gray101.com >
2025-07-23 21:33:26 +02:00
Ettore Di Giacinto
b7b3164736
chore: try to speedup build
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 21:21:23 +02:00
Ettore Di Giacinto
5f7ece3e94
fix(p2p): adapt to backend changes, general improvements ( #5889 )
...
The binary is now named "llama-cpp-rpc-server" for p2p workers.
We also decrease the default token rotation interval, in this way
peer discovery is much more responsive.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 12:40:32 +02:00
Ettore Di Giacinto
c717b8d800
chore(model gallery): add qwen3-coder-480b-a35b-instruct ( #5888 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:59:58 +02:00
Ettore Di Giacinto
f1d35c4149
chore(model gallery): add qwen3-235b-a22b-instruct-2507 ( #5887 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:54:58 +02:00
Ettore Di Giacinto
ee7e77b6c1
chore(model gallery): add menlo_lucy ( #5886 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:51:51 +02:00
Ettore Di Giacinto
324fecbb75
chore(model gallery): add entfane_math-genius-7b ( #5885 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:45:23 +02:00
Ettore Di Giacinto
a79bfcf0a7
chore(model gallery): add dream-org_dream-v0-instruct-7b ( #5884 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:40:53 +02:00
Ettore Di Giacinto
82495e7fb6
chore(model gallery): add omega-qwen3-atom-8b ( #5883 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:33:43 +02:00
Ettore Di Giacinto
6030b12283
chore(backend gallery): add name to 'diffusers' meta
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 09:21:04 +02:00
LocalAI [bot]
b5be867e28
chore: ⬆️ Update ggml-org/llama.cpp to acd6cb1c41676f6bbb25c2a76fa5abeb1719301e ( #5882 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-22 21:12:06 +00:00
Ettore Di Giacinto
9b806250d4
chore: drop vllm for cuda 11 ( #5881 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-22 18:47:31 +02:00
Ettore Di Giacinto
5f066e702f
fix(darwin): add dashes on image suffix
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-22 17:08:19 +02:00
dependabot[bot]
47bb3a3db2
chore(deps): bump securego/gosec from 2.22.5 to 2.22.7 ( #5878 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.5 to 2.22.7.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.5...v2.22.7 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.7
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-07-22 16:42:11 +02:00
Richard Palethorpe
51230a801e
fix(build): Add and update ONEAPI_VERSION ( #5874 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-22 16:41:49 +02:00
Richard Palethorpe
754bedc3ea
fix(realtime): Reset speech started flag on commit ( #5879 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-22 16:41:12 +02:00
Ettore Di Giacinto
98e5291afc
feat: refactor build process, drop embedded backends ( #5875 )
...
* feat: split remaining backends and drop embedded backends
- Drop silero-vad, huggingface, and stores backend from embedded
binaries
- Refactor Makefile and Dockerfile to avoid building grpc backends
- Drop golang code that was used to embed backends
- Simplify building by using goreleaser
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(gallery): be specific with llama-cpp backend templates
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(docs): update
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(ci): minor fixes
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: drop all ffmpeg references
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: run protogen-go
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Always enable p2p mode
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update gorelease file
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(stores): do not always load
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix linting issues
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Simplify
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Mac OS fixup
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-22 16:31:04 +02:00
LocalAI [bot]
e29b2c3aff
chore: ⬆️ Update ggml-org/llama.cpp to 6c9ee3b17e19dcc82ab93d52ae46fdd0226d4777 ( #5877 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-22 08:25:43 +02:00
LocalAI [bot]
8dc574f3c4
chore: ⬆️ Update ggml-org/whisper.cpp to 1f5cf0b2888402d57bb17b2029b2caa97e5f3baf ( #5876 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-22 08:25:13 +02:00
Ettore Di Giacinto
05bf2493a5
fix: do not pass by environ to ffmpeg ( #5871 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-21 14:35:33 +02:00
Max Goltzsche
eae4ca08da
feat(openai): support input_audio chat api field ( #5870 )
...
Improving the chat completion endpoint OpenAI API compatibility by supporting messages of type `input_audio`, e.g.:
```
{
...
"messages": [
{
"role": "user",
"content": [{
"type": "input_audio",
"input_audio": {
"data": "<base64-encoded audio data>",
"format": "wav"
}
}]
}
]
}
```
Closes #5869
Signed-off-by: Max Goltzsche <max.goltzsche@gmail.com >
2025-07-21 09:15:55 +02:00
LocalAI [bot]
fa284f7445
chore: ⬆️ Update ggml-org/llama.cpp to 2be60cbc2707359241c2784f9d2e30d8fc7cdabb ( #5867 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-21 09:14:09 +02:00
Ettore Di Giacinto
8f69b80520
Update index.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-20 22:54:12 +02:00
Ettore Di Giacinto
b1fc5acd4a
feat: split whisper from main binary ( #5863 )
...
* feat: split whisper from main binary
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Cleanup makefile
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add backend builds (missing only darwin)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Test CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add whisper backend to test runs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make sure we have runtime libs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Less grpc on the main Dockerfile
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix hipblas build
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add whisper to index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Re-enable CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt auto-bumper
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-20 22:52:45 +02:00
LocalAI [bot]
fab41c29dd
chore(model-gallery): ⬆️ update checksum ( #5865 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-20 20:37:43 +02:00
Ettore Di Giacinto
fb0ec96396
ci: do not upgrade pip
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-20 12:30:12 +02:00
LocalAI [bot]
7659461036
chore: ⬆️ Update ggml-org/llama.cpp to a979ca22db0d737af1e548a73291193655c6be99 ( #5862 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-20 08:43:36 +02:00
Ettore Di Giacinto
580687da46
feat: remove stablediffusion-ggml from main binary ( #5861 )
...
* feat: split stablediffusion-ggml from main binary
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Test CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt ci tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to support nvidial4t
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Latest fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-19 21:58:53 +02:00
LocalAI [bot]
1929eb2894
chore: ⬆️ Update ggml-org/llama.cpp to bf9087f59aab940cf312b85a67067ce33d9e365a ( #5860 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-19 08:52:07 +02:00
Ettore Di Giacinto
b29544d747
feat: split piper from main binary ( #5858 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-19 08:31:33 +02:00
Ettore Di Giacinto
7c30e82647
fix: autoload backends when installing models from YAML files ( #5859 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-18 21:31:02 +02:00
Dedy F. Setyawan
a1d061c835
fix(docs): Resolve logo overlap on tablet view ( #5853 )
...
* fix(docs): Resolve logo overlap on tablet view
Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com >
* fix(docs): Adjust header logo size
Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com >
* refactor(docs): Rework header logo sizing implementation
Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com >
---------
Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com >
2025-07-18 15:55:44 +02:00
Sijia Lu
851c67019c
fix: dockerfile typo ( #5823 )
...
fix dockerfile typo
Signed-off-by: LeonSijiaLu <leonsijialu1@gmail.com >
2025-07-18 14:59:33 +02:00
Ettore Di Giacinto
53ed5ef189
Makefile fixup
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-18 14:57:14 +02:00
Ettore Di Giacinto
294f7022f3
feat: do not bundle llama-cpp anymore ( #5790 )
...
* Build llama.cpp separately
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Start to try to attach some tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add git and small fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: correctly autoload external backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to run AIO tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Slightly update the Makefile helps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt auto-bumper
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to run linux test
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add llama-cpp into build pipelines
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add default capability (for cpu)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop llama-cpp specific logic from the backend loader
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* drop grpc install in ci for tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Pass by backends path for tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Build protogen at start
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(tests): set backends path consistently
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Correctly configure the backends path
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to build for darwin
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Compile for metal on arm64/darwin
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to run build off from cross-arch
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add to the backend index nvidia-l4t and cpu's llama-cpp backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Build also darwin-x86 for llama-cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Disable arm64 builds temporary
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Test backend build on PR
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixup build backend reusable workflow
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* pass by skip drivers
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use crane
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Skip drivers
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* x86 darwin
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add packaging step for llama.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix leftover from bark-cpp extraction
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to fix hipblas build
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-18 13:24:12 +02:00
Richard Palethorpe
932f6b01a6
feat(realtime): Add speech started and stopped events ( #5856 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-18 09:22:23 +02:00
LocalAI [bot]
e96452c5d4
chore: ⬆️ Update ggml-org/llama.cpp to d6fb3f6b49b27ef1c0f4cf5128e041f7e7dc03af ( #5857 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-17 22:56:40 +00:00
Ettore Di Giacinto
5fc8d5bb78
fix: explorer page should not have login ( #5855 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-17 10:54:03 +02:00
LocalAI [bot]
121937ed6f
chore: ⬆️ Update ggml-org/llama.cpp to 496957e1cbcb522abc63aa18521036e40efce985 ( #5854 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-16 22:52:10 +00:00
LocalAI [bot]
2e38f2a054
chore: ⬆️ Update ggml-org/llama.cpp to 4a4f426944e79b79e389f9ed7b34831cb9b637ad ( #5852 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-15 22:56:04 +00:00
LocalAI [bot]
2a6187bc01
chore: ⬆️ Update ggml-org/llama.cpp to bdca38376f7e8dd928defe01ce6a16218a64b040 ( #5850 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-15 08:16:01 +02:00
LocalAI [bot]
584c48df5a
chore: ⬆️ Update ggml-org/whisper.cpp to 032697b9a850dc2615555e2a93a683cc3dd58559 ( #5849 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-14 22:47:30 +00:00
Ettore Di Giacinto
8dd67748a1
chore(model gallery): add sophosympatheia_strawberrylemonade-70b-v1.1 ( #5848 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-14 15:38:41 +02:00
Ettore Di Giacinto
3fd0bf3c88
chore(model gallery): add zhi-create-qwen3-32b-i1 ( #5847 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-14 15:33:40 +02:00
LocalAI [bot]
4062a6c404
chore: ⬆️ Update ggml-org/llama.cpp to 982e347255723fe6d02e60ee30cfdd0559c884c5 ( #5845 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-14 08:21:54 +02:00
Ettore Di Giacinto
354c0b763e
feat(cli): add command to create custom OCI images from directories ( #5844 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-14 08:21:29 +02:00
LocalAI [bot]
40f9065367
chore: ⬆️ Update ggml-org/whisper.cpp to a16da91365700f396da916d16a7f5a2ec99364b9 ( #5846 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-13 22:46:03 +00:00
Ettore Di Giacinto
fc02bc0aba
chore(model gallery): add google_medgemma-27b-it ( #5843 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-13 18:20:21 +02:00
Ettore Di Giacinto
45badb75e8
chore(model gallery): add google_medgemma-4b-it ( #5842 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-13 17:56:44 +02:00
LocalAI [bot]
d7e1922582
chore: ⬆️ Update ggml-org/whisper.cpp to 3775c503d5133d3d8b99d7d062e87a54064b0eb8 ( #5841 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-13 08:49:15 +02:00
LocalAI [bot]
642a39afa0
chore: ⬆️ Update ggml-org/llama.cpp to c31e60647def83d671bac5ab5b35579bf25d9aa1 ( #5840 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-12 22:44:45 +00:00
Ettore Di Giacinto
34d9deaf39
chore(model gallery): add impish_magic_24b-i1 ( #5839 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-12 19:37:10 +02:00
Ettore Di Giacinto
ef37a73e1b
chore(model gallery): add mistral-2x24b-moe-power-coder-magistral-devstral-reasoning-ultimate-neo-max-44b ( #5838 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-12 19:27:46 +02:00
Ettore Di Giacinto
37de945ae8
chore(model gallery): add nvidia_llama-3_3-nemotron-super-49b-genrm-multilingual ( #5837 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-12 19:21:40 +02:00
LocalAI [bot]
468f1f4539
chore: ⬆️ Update ggml-org/llama.cpp to f5e96b368f1acc7f53c390001b936517c4d18999 ( #5835 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-11 22:46:25 +00:00
Ettore Di Giacinto
0640451368
chore(model gallery): add mistralai_devstral-small-2507 ( #5834 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-11 11:51:11 +02:00
Ettore Di Giacinto
99058511cc
chore(model gallery): add huihui-ai_huihui-gemma-3n-e4b-it-abliterated ( #5833 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-11 11:42:01 +02:00
Ettore Di Giacinto
ec293b3b59
chore(model gallery): add microsoft_nextcoder-32b ( #5832 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-11 11:36:11 +02:00
LocalAI [bot]
9b1b6df8e9
chore: ⬆️ Update ggml-org/llama.cpp to 0b8855775c6b873931d40b77a5e42558aacbde52 ( #5830 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-10 22:48:03 +00:00
Ettore Di Giacinto
cd7fbafcd2
chore(model gallery): add thedrummer_tiger-gemma-12b-v3 ( #5827 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-10 14:19:41 +02:00
Ettore Di Giacinto
e5125216cf
chore(model gallery): add thedrummer_big-tiger-gemma-27b-v3 ( #5826 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-10 14:17:34 +02:00
Ettore Di Giacinto
2105f82433
chore(model gallery): add delta-vector_plesio-70b ( #5825 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-10 14:13:14 +02:00
Ettore Di Giacinto
49c0c7881a
chore(model gallery): add huggingfacetb_smollm3-3b ( #5820 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-09 18:27:52 +02:00
Ettore Di Giacinto
f8829376d8
chore(model gallery): add zerofata_l3.3-geneticlemonade-opus-70b ( #5819 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-09 18:23:01 +02:00
Ettore Di Giacinto
0475f63675
chore(model gallery): add lyranovaheart_starfallen-snow-fantasy-24b-ms3.2-v0.0 ( #5818 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-09 18:20:16 +02:00
Ettore Di Giacinto
ec206cc67c
feat(cli): allow to install backends from OCI tar files ( #5816 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-09 18:19:51 +02:00
LocalAI [bot]
34171fcf94
chore: ⬆️ Update ggml-org/llama.cpp to 6efcd65945a98cf6883cdd9de4c8ccd8c79d219a ( #5817 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-08 22:46:10 +00:00
LocalAI [bot]
238c334aa7
chore: ⬆️ Update ggml-org/whisper.cpp to 869335f2d58d04010535be9ae23a69a9da12a169 ( #5809 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-08 17:46:08 +02:00
Ettore Di Giacinto
d2df0a1769
chore(model gallery): add qwen3-8b-shiningvaliant3 ( #5815 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-08 13:17:43 +02:00
Ettore Di Giacinto
d58647ac31
chore(model gallery): add ockerman0_anubislemonade-70b-v1.1 ( #5814 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-08 13:14:46 +02:00
Ettore Di Giacinto
c1d3ce9a93
chore(model gallery): add cognitivecomputations_dolphin-mistral-24b-venice-edition ( #5813 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-08 13:09:29 +02:00
Richard Palethorpe
c1dd4ff5d5
feat(whisper): Enable SYCL ( #5802 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-08 12:54:20 +02:00
LocalAI [bot]
48118b9582
chore: ⬆️ Update ggml-org/llama.cpp to 12f55c302b35cfe900b84c5fe67c262026af9c44 ( #5808 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-07 22:50:16 +00:00
Ettore Di Giacinto
ceda2e69db
chore(model gallery): add huihui-jan-nano-abliterated ( #5806 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-07 11:35:39 +02:00
Ettore Di Giacinto
cea1703acc
chore(model gallery): add zonui-3b-i1 ( #5805 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-07 11:32:58 +02:00
Ettore Di Giacinto
33fc9b9922
chore(model gallery): add mini-hydra ( #5804 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-07 11:27:42 +02:00
Ettore Di Giacinto
b783997c52
chore(model gallery): add compumacy-experimental-32b ( #5803 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-07 11:21:44 +02:00
LocalAI [bot]
f6ec06d21c
chore: ⬆️ Update ggml-org/llama.cpp to 6491d6e4f1caf0ad2221865b4249ae6938a6308c ( #5801 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-06 22:45:50 +00:00
Ettore Di Giacinto
7e1f2657d5
Update GPU-acceleration.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-06 19:03:34 +02:00
Ettore Di Giacinto
9589097252
chore(model gallery): add nano_imp_1b-q8_0 ( #5800 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-06 18:58:56 +02:00
Ettore Di Giacinto
cb87d331a9
chore(model gallery): add sicariussicariistuff_impish_llama_4b ( #5799 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-06 18:57:18 +02:00
Ettore Di Giacinto
6dfc96249a
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-06 18:07:36 +02:00
LocalAI [bot]
a2564ed654
chore: ⬆️ Update ggml-org/llama.cpp to a0374a67e2924f2e845cdc59dd67d9a44065a89c ( #5798 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-05 22:48:28 +00:00
LocalAI [bot]
6c747caa34
chore: ⬆️ Update ggml-org/llama.cpp to ef797db357e44ecb7437fa9d22f4e1614104b342 ( #5795 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-04 22:46:51 +00:00
Ettore Di Giacinto
8ae5e0feb9
chore(model gallery): add ockerman0_anubislemonade-70b-v1 ( #5794 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-04 18:43:35 +02:00
Ettore Di Giacinto
c35dd0a7b8
chore(model gallery): add zerofata_ms3.2-paintedfantasy-visage-33b ( #5793 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-04 18:42:01 +02:00
Ettore Di Giacinto
2f5af6b246
chore(model gallery): add agentica-org_deepswe-preview ( #5792 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-04 18:39:36 +02:00
Ettore Di Giacinto
00cf2e0e0a
chore(model gallery): add helpingai_dhanishtha-2.0-preview ( #5791 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-04 18:38:18 +02:00
LocalAI [bot]
c7a1d9c089
chore: ⬆️ Update ggml-org/llama.cpp to bee28421be25fd447f61cb6db64d556cbfce32ec ( #5788 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-04 08:46:56 +02:00
LocalAI [bot]
ad7ba52166
chore: ⬆️ Update PABannier/bark.cpp to 5d5be84f089ab9ea53b7a793f088d3fbf7247495 ( #4786 )
...
⬆️ Update PABannier/bark.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-03 22:44:53 +00:00
Ettore Di Giacinto
c5b9f45166
chore(cli): add backends CLI to manipulate and install backends ( #5787 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-03 19:31:27 +02:00
Ettore Di Giacinto
61b64a65ab
chore(bark-cpp): generalize and move to bark-cpp ( #5786 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-03 19:31:10 +02:00
Ettore Di Giacinto
8276952920
feat(system): detect and allow to override capabilities ( #5785 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-03 19:30:52 +02:00
Ettore Di Giacinto
b7cd5bfaec
feat(backends): add metas in the gallery ( #5784 )
...
* chore(backends): add metas in the gallery
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: correctly handle aliases and metas with same names
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-03 18:01:55 +02:00
LocalAI [bot]
da4312e4d3
chore: ⬆️ Update ggml-org/llama.cpp to e75ba4c0434eb759eb7ff74e034ebe729053e575 ( #5783 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-03 10:01:17 +02:00
LocalAI [bot]
7d507c54ed
chore: ⬆️ Update ggml-org/whisper.cpp to d9999d54c868b8bfcd376aa26067e787d53e679e ( #5782 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-03 09:57:36 +02:00
LocalAI [bot]
df7ed49889
docs: ⬆️ update docs version mudler/LocalAI ( #5781 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-02 22:45:21 +00:00
Ettore Di Giacinto
bfdc29d316
fix(gallery): correctly show status for downloading OCI images ( #5774 )
...
We can't use the mutate.Extract written bytes as current status as that
will be bigger than the compressed image size. Image manifest don't have
any guarantee of the type of artifact (can be compressed or not) when
showing the layer size.
Split the extraction process in two parts: Downloading and extracting as
a flattened system, in this way we can display the status of downloading
and extracting accordingly.
This change also fixes a small nuance in detecting installed backends,
now it's more consistent and looks if a metadata.json and/or a path with
a `run.sh` file is present.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-02 08:25:48 +02:00
LocalAI [bot]
7fdc006071
chore: ⬆️ Update ggml-org/llama.cpp to de569441470332ff922c23fb0413cc957be75b25 ( #5777 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-02 08:25:29 +02:00
LocalAI [bot]
615830245b
chore: ⬆️ Update ggml-org/whisper.cpp to bca021c9740b267c2973fba56555be052006023a ( #5776 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-02 08:24:58 +02:00
LocalAI [bot]
61376c0fa7
docs: ⬆️ update docs version mudler/LocalAI ( #5775 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-01 22:45:24 +00:00
Ettore Di Giacinto
d0fb23514f
Revert "fix(gallery): correctly show status for downloading OCI images"
...
This reverts commit 780d034ac9 .
2025-07-01 21:32:04 +02:00
Ettore Di Giacinto
780d034ac9
fix(gallery): correctly show status for downloading OCI images
...
We can't use the mutate.Extract written bytes as current status as that
will be bigger than the compressed image size. Image manifest don't have
any guarantee of the type of artifact (can be compressed or not) when
showing the layer size.
Split the extraction process in two parts: Downloading and extracting as
a flattened system, in this way we can display the status of downloading
and extracting accordingly.
This change also fixes a small nuance in detecting installed backends,
now it's more consistent and looks if a metadata.json and/or a path with
a `run.sh` file is present.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-01 19:56:28 +02:00
Ettore Di Giacinto
ec2a044c7e
chore(model gallery): add pinkpixel_crystal-think-v2 ( #5773 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-01 16:18:19 +02:00
Ettore Di Giacinto
ad6fdd21fd
chore(model gallery): add steelskull_l3.3-shakudo-70b ( #5772 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-01 16:15:22 +02:00
Ettore Di Giacinto
cd94e6b352
chore(model gallery): add thedrummer_anubis-70b-v1.1 ( #5771 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-01 13:54:29 +02:00
Richard Palethorpe
b37cef3718
fix: Diffusers and XPU fixes ( #5737 )
...
* fix(README): Add device flags for Intel/XPU
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(diffusers/xpu): Set device to XPU and ignore CUDA request when on Intel
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-01 12:36:17 +02:00
Dedy F. Setyawan
9f957d547d
fix(docs): Improve Header Responsiveness - Hide "Star us on GitHub!" on Mobile ( #5770 )
2025-07-01 12:15:16 +02:00
LocalAI [bot]
f0d9f0c5d8
chore: ⬆️ Update ggml-org/llama.cpp to 0a5a3b5cdfd887cf0f8e09d9ff89dee130cfcdde ( #5759 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-30 22:46:55 +00:00
LocalAI [bot]
d33e1c72a3
chore: ⬆️ Update ggml-org/llama.cpp to caf5681fcb47dfe9bafee94ef9aa8f669ac986c7 ( #5758 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-29 22:49:47 +00:00
Ettore Di Giacinto
33f9ee06c9
fix(gallery): automatically install model from name ( #5757 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-29 17:42:58 +02:00
Ettore Di Giacinto
c54677402d
chore(model gallery): add qwen3-33b-a3b-stranger-thoughts-abliterated-uncensored ( #5755 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-29 10:04:33 +02:00
LocalAI [bot]
3fe3a7b23d
chore: ⬆️ Update ggml-org/llama.cpp to 27208bf657cfe7262791df473927225e48efe482 ( #5753 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-29 09:06:37 +02:00
LocalAI [bot]
f8ff6fa1fd
docs: ⬆️ update docs version mudler/LocalAI ( #5752 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-28 22:17:49 +02:00
Ettore Di Giacinto
dfadc3696e
feat(llama.cpp): allow to set kv-overrides ( #5745 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 21:26:07 +02:00
Ettore Di Giacinto
dbcf5fb4fc
chore(model gallery): add gemma-3-4b-it-max-horror-uncensored-dbl-x-imatrix ( #5751 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 18:18:14 +02:00
Ettore Di Giacinto
2633137a17
chore(model gallery): add qwen3-22b-a3b-the-harley-quinn ( #5750 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 18:17:10 +02:00
Ettore Di Giacinto
d9c17dd23b
chore(model gallery): add mistral-small-3.2-46b-the-brilliant-raconteur-ii-instruct-2506 ( #5749 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 18:15:25 +02:00
Ettore Di Giacinto
d8b7bd4860
chore(model gallery): add qwen3-42b-a3b-stranger-thoughts-deep20x-abliterated-uncensored-i1 ( #5748 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 18:12:17 +02:00
Ettore Di Giacinto
a611cbc0f4
chore(model gallery): add qwen3-55b-a3b-total-recall-deep-40x ( #5747 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 17:54:32 +02:00
Ettore Di Giacinto
850b525159
chore(model gallery): add qwen3-55b-a3b-total-recall-v1.3-i1 ( #5746 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 17:47:46 +02:00
Ettore Di Giacinto
35b3426a2a
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-28 09:16:25 +02:00
LocalAI [bot]
cd2b0c0e7c
chore: ⬆️ Update ggml-org/llama.cpp to 72babea5dea56c8a8e8420ccf731b12a5cf37854 ( #5743 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-27 23:46:27 +02:00
LocalAI [bot]
73d80c43a8
chore: ⬆️ Update ggml-org/whisper.cpp to c88ffbf9baeaae8c2cc0a4f496618314bb2ee9e0 ( #5742 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-27 23:45:57 +02:00
LocalAI [bot]
665562b850
docs: ⬆️ update docs version mudler/LocalAI ( #5741 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-27 22:23:43 +02:00
Ettore Di Giacinto
7a78e4f482
fix(backends gallery): meta packages do not have URIs ( #5740 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-27 22:23:14 +02:00
Ettore Di Giacinto
6f41a6f934
fix(backends gallery): correctly identify gpu vendor ( #5739 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-27 22:22:58 +02:00
Ettore Di Giacinto
bb54f2da2b
feat(gallery): automatically install missing backends along models ( #5736 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-27 18:25:44 +02:00
Ettore Di Giacinto
e1cc7ee107
fix(ci): enable tag-latest to auto ( #5738 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-27 18:17:01 +02:00
Ettore Di Giacinto
cfc9dfa3d5
fix(ci): better handling of latest images for backends ( #5735 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-27 10:32:58 +02:00
LocalAI [bot]
6a650e68cb
chore: ⬆️ Update ggml-org/whisper.cpp to 32cf4e2aba799aff069011f37ca025401433cf9f ( #5733 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-26 22:09:26 +02:00
LocalAI [bot]
5e1373877a
chore: ⬆️ Update ggml-org/llama.cpp to 8846aace4934ad29651ea61b8c7e3f6b0556e3d2 ( #5734 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-26 22:09:03 +02:00
Ettore Di Giacinto
b5b0ab26e7
fix(ci): remove non-existant input from build matrix
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-26 21:42:27 +02:00
Ettore Di Giacinto
9725bb4bbd
chore(model gallery): add gemma-3n-e4b-it ( #5731 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-26 19:36:50 +02:00
Ettore Di Giacinto
33b4275bbc
chore(model gallery): add gemma-3n-e2b-it ( #5730 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-26 19:35:49 +02:00
Ettore Di Giacinto
6644af10c6
feat: ⚠️ reduce images size and stop bundling sources ( #5721 )
...
feat: reduce images size and stop bundling sources
Do not copy sources anymore, and reduce packages of the base images by
not using builder images.
If needed to rebuild, just build the container image from scratch by
following the docs. We will slowly try to migrate all backends to the
gallery to keep the core small.
This PR is a breaking change, it also sets the base folders to /models
and /backends instead of /build/models and /build/backends.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-26 18:41:38 +02:00
Ettore Di Giacinto
7c4a2e9b85
chore(ci): ⚠️ fix latest tag by using docker meta action ( #5722 )
...
chore(ci): fix latest tag by using docker meta action
Also uniform tagging names
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-26 18:40:25 +02:00
Ettore Di Giacinto
bcccee3909
fix(backends gallery): delete dangling dirs if installation failed ( #5729 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-26 17:38:03 +02:00
Ettore Di Giacinto
c6f50ddd0c
Revert "chore: ⬆️ Update leejet/stable-diffusion.cpp to 10c6501bd05a697e014f1bee3a84e5664290c489" ( #5727 )
...
Revert "chore: ⬆️ Update leejet/stable-diffusion.cpp to `10c6501bd05a…"
This reverts commit 30600dd5cb .
2025-06-26 13:25:25 +02:00
LocalAI [bot]
6613373b1b
chore: ⬆️ Update ggml-org/whisper.cpp to 4daf7050ca2bf17f5166f45ac6da651c4e33f293 ( #5725 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-26 13:00:20 +02:00
LocalAI [bot]
1659b3f795
chore: ⬆️ Update ggml-org/llama.cpp to 2bf9d539dd158345e3a3b096e16474af535265b4 ( #5724 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-26 12:59:57 +02:00
LocalAI [bot]
30600dd5cb
chore: ⬆️ Update leejet/stable-diffusion.cpp to 10c6501bd05a697e014f1bee3a84e5664290c489 ( #4925 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-25 22:45:15 +00:00
Ettore Di Giacinto
179fcf5541
chore(model gallery): add menlo_jan-nano-128k ( #5723 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-25 12:14:11 +02:00
LocalAI [bot]
9cb75086bb
chore: ⬆️ Update ggml-org/whisper.cpp to 0083335ba0e9d6becbe0958903b0a27fc2ebaeed ( #5718 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-25 09:47:33 +02:00
LocalAI [bot]
594bb462ab
chore: ⬆️ Update ggml-org/llama.cpp to 73e53dc834c0a2336cd104473af6897197b96277 ( #5719 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-24 22:47:48 +00:00
Ettore Di Giacinto
aa730a7b96
chore(model gallery): add delta-vector_austral-24b-winton ( #5717 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-24 18:37:28 +02:00
Ettore Di Giacinto
0a454c527a
chore(model gallery): add astrosage-70b ( #5716 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-24 18:34:37 +02:00
Ettore Di Giacinto
cf86bcb984
chore(model gallery): add skywork_skywork-swe-32b ( #5715 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-24 18:29:36 +02:00
Ettore Di Giacinto
a6d9988e84
feat(backend gallery): add meta packages ( #5696 )
...
* feat(backend gallery): add meta packages
So we can have meta packages such as "vllm" that automatically installs
the corresponding package depending on the GPU that is being currently
detected in the system.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: use a metadata file
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-24 17:08:27 +02:00
Ettore Di Giacinto
f3a114342e
chore(model gallery): add mistralai_mistral-small-3.2-24b-instruct-2506 ( #5714 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-24 13:59:14 +02:00
LocalAI [bot]
0d275ccc03
chore: ⬆️ Update ggml-org/llama.cpp to ce82bd0117bd3598300b3a089d13d401b90279c7 ( #5712 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-24 08:37:32 +02:00
LocalAI [bot]
58dba3f01c
chore: ⬆️ Update ggml-org/whisper.cpp to a422176937c5bb20eb58d969995765f90d3c1a9b ( #5713 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-23 22:45:31 +00:00
kilavvy
b68d6e8088
Docs: Fix typos ( #5709 )
...
* Update GPU-acceleration.md
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
* Update image-generation.md
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
---------
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
2025-06-23 18:15:06 +02:00
LocalAI [bot]
2352cec7e6
chore: ⬆️ Update ggml-org/llama.cpp to 238005c2dc67426cf678baa2d54c881701693288 ( #5710 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-22 22:43:41 +00:00
Ettore Di Giacinto
de72ae79b5
chore(model gallery): add ds-r1-qwen3-8b-arliai-rpr-v4-small-iq-imatrix ( #5708 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-22 09:05:55 +02:00
Ettore Di Giacinto
884c07d5f9
chore(model gallery): add allura-org_q3-8b-kintsugi ( #5707 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-22 09:04:49 +02:00
Ettore Di Giacinto
cca7cbef1e
chore(model gallery): add qwen3-the-xiaolong-omega-directive-22b-uncensored-abliterated-i1 ( #5706 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-22 09:01:08 +02:00
Ettore Di Giacinto
32cd0d03d4
chore(model gallery): add menlo_jan-nano ( #5705 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-22 08:57:33 +02:00
Ettore Di Giacinto
ee4d9e83d0
Update stalebot.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-22 08:51:13 +02:00
LocalAI [bot]
5547e08a30
chore: ⬆️ Update ggml-org/llama.cpp to aa0ef5c578eef4c2adc7be1282f21bab5f3e8d26 ( #5703 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-21 23:54:53 +02:00
LocalAI [bot]
ca7385c303
chore: ⬆️ Update ggml-org/whisper.cpp to e6c10cf3d5d60dc647eb6cd5e73d3c347149f746 ( #5702 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-21 23:54:28 +02:00
Ettore Di Giacinto
28759e79d3
chore(model gallery): add qwen3-the-josiefied-omega-directive-22b-uncensored-abliterated-i1 ( #5704 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-21 23:54:05 +02:00
Ettore Di Giacinto
40249b6b84
Update stalebot.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-21 22:38:23 +02:00
Ettore Di Giacinto
e09e47bada
chore(ci): add stale bot ( #5700 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-21 20:12:08 +02:00
Ettore Di Giacinto
3796558aeb
Update quickstart.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-21 20:11:57 +02:00
LocalAI [bot]
cca4f010f8
chore: ⬆️ Update ggml-org/llama.cpp to 06cbedfca1587473df9b537f1dd4d6bfa2e3de13 ( #5697 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-20 22:44:39 +00:00
Ettore Di Giacinto
be3ff482d0
chore(ci): try to optimize disk space when tagging latest ( #5695 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-20 15:54:14 +02:00
LocalAI [bot]
af255cd0be
chore: ⬆️ Update ggml-org/llama.cpp to 8f71d0f3e86ccbba059350058af8758cafed73e6 ( #5692 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-20 15:53:55 +02:00
LocalAI [bot]
8000228d1b
chore: ⬆️ Update ggml-org/whisper.cpp to 3e65f518ddf840b13b74794158aa95a2c8aa30cc ( #5691 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-20 15:53:38 +02:00
Ettore Di Giacinto
79abe0ad77
Drop latest references to extras images
2025-06-20 15:51:16 +02:00
Ettore Di Giacinto
8131d11d1f
Update quickstart.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-19 22:42:38 +02:00
LocalAI [bot]
beb01c91f3
docs: ⬆️ update docs version mudler/LocalAI ( #5690 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-19 22:13:16 +02:00
Ettore Di Giacinto
1ccd64ff6a
chore: drop extras references from docs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-19 22:04:28 +02:00
Ettore Di Giacinto
fc7681c68c
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-19 21:46:09 +02:00
Ettore Di Giacinto
49d026a229
Update backends.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-19 19:47:09 +02:00
leopardracer
f9b968e19d
Fix Typos and Improve Clarity in GPU Acceleration Documentation ( #5688 )
...
Update GPU-acceleration.md
Signed-off-by: leopardracer <136604165+leopardracer@users.noreply.github.com >
2025-06-19 15:41:13 +02:00
LocalAI [bot]
022d4a5ecb
chore: ⬆️ Update ggml-org/whisper.cpp to ecb8f3c2b4e282d5ef416516bcbfb92821f06bf6 ( #5686 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-19 08:23:42 +02:00
LocalAI [bot]
0e917eb01d
chore: ⬆️ Update ggml-org/llama.cpp to 8d947136546773f6410756f37fcc5d3e65b8135d ( #5685 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-19 08:23:23 +02:00
Ettore Di Giacinto
efde0eaf83
feat(backend gallery): display download progress ( #5687 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 23:49:44 +02:00
Maxim Evtush
add8fc35a2
Fix Typos in Documentation and Python Comments ( #5658 )
...
* Update istftnet.py
Signed-off-by: Maxim Evtush <154841002+maximevtush@users.noreply.github.com >
* Update GPU-acceleration.md
Signed-off-by: Maxim Evtush <154841002+maximevtush@users.noreply.github.com >
---------
Signed-off-by: Maxim Evtush <154841002+maximevtush@users.noreply.github.com >
2025-06-18 22:11:13 +02:00
Ettore Di Giacinto
9bcf4c56f1
fix(backends gallery): propagate p2p settings to correctly draw menu ( #5684 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 22:06:12 +02:00
Ettore Di Giacinto
3fcfaec7c8
chore(ci): move also other jobs to public runner ( #5683 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 22:00:12 +02:00
Ettore Di Giacinto
a463d40a3e
chore(ci): try to use public runners also for release builds ( #5681 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 21:51:54 +02:00
Ettore Di Giacinto
1e1f0ee321
chore(backends): move bark-cpp to the backend gallery ( #5682 )
...
chore(bark-cpp): move outside from binary
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 19:48:50 +02:00
Ettore Di Giacinto
80b3139fa0
Update landing.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-18 19:48:17 +02:00
LocalAI [bot]
5173d37acb
chore: ⬆️ Update ggml-org/llama.cpp to 860a9e4eeff3eb2e7bd1cc38f65787cc6c8177af ( #5678 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-18 10:01:46 +02:00
LocalAI [bot]
470e48a900
chore: ⬆️ Update ggml-org/whisper.cpp to f3ff80ea8da044e5b8833e7ba54ee174504c518d ( #5677 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-18 10:01:08 +02:00
Ettore Di Giacinto
b706dddc93
chore(ci): switch to public runners for base images ( #5680 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 22:38:50 +02:00
Ettore Di Giacinto
867db3f888
chore(docs): add backend url
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 22:35:21 +02:00
Ettore Di Giacinto
b79aa31398
chore: move backends docs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 22:26:40 +02:00
Ettore Di Giacinto
fb9a09d49c
chore(backend gallery): add description for remaining backends ( #5679 )
...
* chore(backend gallery): add description for remaining backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(backend gallery): add linter
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 22:21:44 +02:00
Ettore Di Giacinto
0a78f0ad2d
chore(backend gallery): re-order and add description for vLLM ( #5676 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 17:31:53 +02:00
Ettore Di Giacinto
d68660bd5a
chore(deps): bump llama.cpp to 'e434e69183fd9e1031f4445002083178c331a28b ( #5665 )
...
chore(deps): bump llama.cpp to 'e434e69183fd9e1031f4445002083178c331a28b'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 17:00:10 +02:00
LocalAI [bot]
30ceee2dec
chore: ⬆️ Update ggml-org/whisper.cpp to 2a4d6db7d90899aff3d58d70996916968e4e0d27 ( #5661 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-17 09:21:05 +02:00
dependabot[bot]
18c38335fc
chore(deps): bump securego/gosec from 2.22.4 to 2.22.5 ( #5663 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.4 to 2.22.5.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.4...v2.22.5 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.5
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-06-16 23:12:27 +00:00
Ettore Di Giacinto
89040ff6f7
fix: add python symlink, use absolute python env path when running backends ( #5664 )
...
* fix: add python symlink, use absolute python env path when running backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(ci): do not push images when building PRs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-16 23:00:53 +02:00
Ettore Di Giacinto
de343700fd
Don't run python_backend workflow on PR
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-16 11:06:56 +02:00
Ettore Di Giacinto
87d18ad951
chore: Add python3 to images ( #5660 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-16 11:05:44 +02:00
Ettore Di Giacinto
912c8eff04
chore(ci): use public runner for extra backends ( #5657 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-16 08:21:18 +02:00
LocalAI [bot]
481f30bde8
chore: ⬆️ Update ggml-org/llama.cpp to 30e5b01de2a0bcddc7c063c8ef0802703a958417 ( #5659 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-15 23:03:40 +00:00
Ettore Di Giacinto
236ac30252
chore(ci): do not specify image-type anymore
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 17:28:40 +02:00
Ettore Di Giacinto
6f761e62e4
update README
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 16:06:43 +02:00
FT
1f29b5f38e
Fix Typos and Improve Documentation Clarity ( #5648 )
...
* Update p2p.go
Signed-off-by: FT <140458077+zeevick10@users.noreply.github.com >
* Update GPU-acceleration.md
Signed-off-by: FT <140458077+zeevick10@users.noreply.github.com >
---------
Signed-off-by: FT <140458077+zeevick10@users.noreply.github.com >
2025-06-15 16:04:44 +02:00
LocalAI [bot]
33d702c5e0
chore: ⬆️ Update ggml-org/llama.cpp to 3cb203c89f60483e349f841684173446ed23c28f ( #5644 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 16:03:13 +02:00
Ettore Di Giacinto
95ff236127
ci: do not fire python_backend on PRs
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 16:02:30 +02:00
Ettore Di Giacinto
2d64269763
feat: Add backend gallery ( #5607 )
...
* feat: Add backend gallery
This PR add support to manage backends as similar to models. There is
now available a backend gallery which can be used to install and remove
extra backends.
The backend gallery can be configured similarly as a model gallery, and
API calls allows to install and remove new backends in runtime, and as
well during the startup phase of LocalAI.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add backends docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* wip: Backend Dockerfile for python backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: drop extras images, build python backends separately
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixup on all backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* test CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Tweaks
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop old backends leftovers
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixup CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Move dockerfile upper
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix proto
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Feature dropped for consistency - we prefer model galleries
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add missing packages in the build image
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* exllama is ponly available on cublas
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* pin torch on chatterbox
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups to index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Debug CI
* Install accellerators deps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add target arch
* Add cuda minor version
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use self-hosted runners
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* ci: use quay for test images
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups for vllm and chatterbox
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small fixups on CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chatterbox is only available for nvidia
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Simplify CI builds
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt test, use qwen3
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(model gallery): add jina-reranker-v1-tiny-en-gguf
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(gguf-parser): recover from potential panics that can happen while reading ggufs with gguf-parser
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use reranker from llama.cpp in AIO images
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Limit concurrent jobs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 14:56:52 +02:00
LocalAI [bot]
a7a6020328
chore: ⬆️ Update ggml-org/whisper.cpp to 705db0f728310c32bc96f4e355e2b18076932f75 ( #5643 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-15 08:39:00 +02:00
Ettore Di Giacinto
40618164b2
chore: improve tests ( #5646 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-14 10:07:05 +02:00
fuder.eth
eb8c29f90a
Minor Documentation Updates: Clarified Comments in Python and Go Files ( #5641 )
...
* Update ui.go
Signed-off-by: fuder.eth <139509124+vtjl10@users.noreply.github.com >
* Update backend.py
Signed-off-by: fuder.eth <139509124+vtjl10@users.noreply.github.com >
---------
Signed-off-by: fuder.eth <139509124+vtjl10@users.noreply.github.com >
2025-06-13 19:55:25 +02:00
Gavin Mogan
63116a2c6a
docs: Update docs metadata headers so when mentioned on slack it doesn't say hugo ( #5642 )
...
Update docs metadata headers so when mentioned on slack it doesn't say hugo
Signed-off-by: Gavin Mogan <github@gavinmogan.com >
2025-06-13 19:54:57 +02:00
LocalAI [bot]
311c2cf539
chore: ⬆️ Update ggml-org/llama.cpp to ed52f3668e633423054a4eab61bb7efee47025ab ( #5636 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-12 23:33:33 +02:00
Ettore Di Giacinto
a6fcbd991d
chore(model gallery): add yanfei-v2-qwen3-32b ( #5639 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-12 22:24:13 +02:00
kilavvy
2e1dc8deef
Fix Typos in Comments and Error Messages ( #5637 )
...
* Update initializers.go
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
* Update base.go
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
---------
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
2025-06-12 18:34:32 +02:00
LocalAI [bot]
282e017b22
chore: ⬆️ Update ggml-org/whisper.cpp to ebbc874e85b518f963a87612f6d79f5c71a55e84 ( #5635 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-11 23:47:00 +02:00
Ettore Di Giacinto
f86cb8be2d
chore(model gallery): add qwen3-embedding-0.6b ( #5634 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:40:41 +02:00
Ettore Di Giacinto
5c56ec4f87
chore(model gallery): add qwen3-embedding-8b ( #5633 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:38:44 +02:00
Ettore Di Giacinto
dd2845a034
chore(model gallery): add qwen3-embedding-4b ( #5632 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:31:43 +02:00
Ettore Di Giacinto
2e7db014b6
chore(model gallery): add openbuddy_openbuddy-r1-0528-distill-qwen3-32b-preview0-qat ( #5631 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:27:30 +02:00
Ettore Di Giacinto
6faeee1d92
chore(model gallery): add baai_robobrain2.0-7b ( #5630 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:17:32 +02:00
Ettore Di Giacinto
31d73eb934
chore(model gallery): add mistralai_magistral-small-2506 ( #5629 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:11:44 +02:00
Ettore Di Giacinto
60863b9e52
chore(model gallery): add sophosympatheia_strawberrylemonade-l3-70b-v1.0 ( #5628 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:08:17 +02:00
Ettore Di Giacinto
a9fc71e2f3
chore(model gallery): add kwaipilot_kwaicoder-autothink-preview ( #5627 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:06:38 +02:00
leopardracer
ce9a9a30e0
Improve Comments and Documentation for MixedMode and ParseJSON Functions ( #5626 )
...
Update parse.go
Signed-off-by: leopardracer <136604165+leopardracer@users.noreply.github.com >
2025-06-11 09:46:53 +02:00
LocalAI [bot]
2693a21da5
chore: ⬆️ Update ggml-org/whisper.cpp to 2679bec6e09231c6fd59715fcba3eebc9e2f6076 ( #5625 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-11 09:35:28 +02:00
LocalAI [bot]
d460eab18e
chore: ⬆️ Update ggml-org/llama.cpp to 3678b838bb71eaccbaeb479ff38c2e12bfd2f960 ( #5620 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-11 09:00:39 +02:00
LocalAI [bot]
c61e5fe266
chore: ⬆️ Update ggml-org/whisper.cpp to d78f08142381c1460604713e2f2ddf3331c7d816 ( #5619 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-10 17:29:58 +02:00
Ettore Di Giacinto
88e570b5de
fix(deps): pin grpcio ( #5621 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-10 14:21:51 +02:00
Ettore Di Giacinto
6efa97ce0b
chore(model gallery): add qwen2.5-omni-3b ( #5606 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-09 10:54:42 +02:00
LocalAI [bot]
41cde5468a
chore: ⬆️ Update ggml-org/llama.cpp to 247e5c6e447707bb4539bdf1913d206088a8fc69 ( #5605 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-09 00:11:46 +02:00
Richard Palethorpe
d650647db9
fix(realtime): Use updated model on session update ( #5604 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-06-09 00:11:05 +02:00
LocalAI [bot]
5bc7ef37a2
chore: ⬆️ Update ggml-org/llama.cpp to 5787b5da57e54dba760c2deeac1edf892e8fc450 ( #5601 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-08 08:44:24 +02:00
Ettore Di Giacinto
e0a52807c8
chore(model gallery): add akhil-theerthala_kuvera-8b-v0.1.0 ( #5600 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-07 08:59:20 +02:00
LocalAI [bot]
1a95a19f87
chore: ⬆️ Update ggml-org/llama.cpp to 745aa5319b9930068aff5e87cf5e9eef7227339b ( #5598 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-07 08:59:05 +02:00
LocalAI [bot]
bcfc08e5bf
chore: ⬆️ Update ggml-org/whisper.cpp to b175baa665bc35f97a2ca774174f07dfffb84e19 ( #5597 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-07 08:57:52 +02:00
Ettore Di Giacinto
4d282ca963
chore(model gallery): add nbeerbower_qwen3-gutenberg-encore-14b ( #5596 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-06 10:20:48 +02:00
Ettore Di Giacinto
525f49b69d
chore(model gallery): add open-thoughts_openthinker3-7b ( #5595 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-06 10:14:00 +02:00
LocalAI [bot]
786aa1de05
chore: ⬆️ Update ggml-org/llama.cpp to 1caae7fc6c77551cb1066515e0f414713eebb367 ( #5593 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-06 00:10:02 +02:00
Ettore Di Giacinto
ea82deb16b
chore(model gallery): add ultravox-v0_5-llama-3_1-8b ( #5592 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-05 19:23:51 +02:00
Ettore Di Giacinto
b0891309ba
chore(model gallery): add ultravox-v0_5-llama-3_2-1b ( #5591 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-05 19:22:01 +02:00
Ettore Di Giacinto
b034cff149
feat: improve RAM estimation by using values from summary ( #5525 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-05 19:16:26 +02:00
Ettore Di Giacinto
432f34f001
chore(model gallery): add goekdeniz-guelmez_josiefied-qwen3-14b-abliterated-v3 ( #5590 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-05 19:16:04 +02:00
Gavin Mogan
cbd61dccd4
fix(install.sh): vulkan docker tag ( #5589 )
...
vulkan docker tag is not prefixed with gpu
```
regctl tag ls localai/localai | grep 2.29 | grep vulkan
v2.29.0-vulkan
```
Signed-off-by: Gavin Mogan <github@gavinmogan.com >
2025-06-05 08:12:16 +02:00
LocalAI [bot]
0de0817d71
chore: ⬆️ Update ggml-org/whisper.cpp to 799eacdde40b3c562cfce1508da1354b90567f8f ( #5586 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-05 08:11:38 +02:00
LocalAI [bot]
bf57d6e5ac
chore: ⬆️ Update ggml-org/llama.cpp to 0d3984424f2973c49c4bcabe4cc0153b4f90c601 ( #5585 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-05 08:11:12 +02:00
Ettore Di Giacinto
0b9603e010
chore(model gallery): add deepseek-ai_deepseek-r1-0528-qwen3-8b ( #5580 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 15:28:45 +02:00
Ettore Di Giacinto
8d925217f6
chore(model gallery): add e-n-v-y_legion-v2.1-llama-70b-elarablated-v0.8-hf ( #5579 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 11:12:37 +02:00
Ettore Di Giacinto
669a1ccae6
chore(model gallery): add nvidia_nemotron-research-reasoning-qwen-1.5b ( #5578 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 11:07:10 +02:00
Ettore Di Giacinto
7a7d36ad63
chore(model gallery): add arcee-ai_homunculus ( #5577 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 10:02:15 +02:00
Ettore Di Giacinto
8b889955b4
chore(deps): bump pytorch to 2.7 in vllm ( #5576 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 08:56:45 +02:00
dependabot[bot]
a226555949
chore(deps): bump GrantBirki/git-diff-action from 2.8.0 to 2.8.1 ( #5564 )
...
Bumps [GrantBirki/git-diff-action](https://github.com/grantbirki/git-diff-action ) from 2.8.0 to 2.8.1.
- [Release notes](https://github.com/grantbirki/git-diff-action/releases )
- [Commits](https://github.com/grantbirki/git-diff-action/compare/v2.8.0...v2.8.1 )
---
updated-dependencies:
- dependency-name: GrantBirki/git-diff-action
dependency-version: 2.8.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-06-04 08:41:47 +02:00
LocalAI [bot]
f38f17865a
chore: ⬆️ Update ggml-org/whisper.cpp to 82f461eaa4e6a1ba29fc0dbdaa415a9934ee8a1d ( #5575 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-04 08:41:26 +02:00
LocalAI [bot]
03f380701b
chore: ⬆️ Update ggml-org/llama.cpp to 7e00e60ef86645a01fda738fef85b74afa016a34 ( #5574 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-04 08:26:36 +02:00
Ettore Di Giacinto
65e2866c97
fix(chatterbox): install only with cuda 12 ( #5573 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-03 14:57:47 +02:00
Ettore Di Giacinto
cd3cd899ad
chore(deps): bump llama.cpp to '363757628848a27a435bbf22ff9476e9aeda5f40' ( #5571 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-03 12:19:16 +02:00
LocalAI [bot]
c2ae3100e7
chore: ⬆️ Update ggml-org/whisper.cpp to e05af2457b7b4134ee626dc044294a19b096e62f ( #5569 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-03 11:29:18 +02:00
Ettore Di Giacinto
ec0868e691
chore(deps): bump grpcio from 1.72.0 to 1.72.1 ( #5570 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-03 09:59:43 +02:00
Ettore Di Giacinto
489c289916
Revert "fix(ci): try to add different mirrors to avoid 403 issues" ( #5555 )
...
Revert "fix(ci): try to add different mirrors to avoid 403 issues (#5554 )"
This reverts commit 7c9f011d91 .
2025-06-02 08:46:29 +02:00
LocalAI [bot]
ac5fb50bcc
chore: ⬆️ Update ggml-org/whisper.cpp to 7fd6fa809749078aa00edf945e959c898f2bd1af ( #5556 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-02 08:45:47 +02:00
Ettore Di Giacinto
7c9f011d91
fix(ci): try to add different mirrors to avoid 403 issues ( #5554 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-01 08:48:53 +02:00
Ettore Di Giacinto
80f7f17843
chore(deps): bump llama.cpp to 'e562eece7cb476276bfc4cbb18deb7c0369b2233' ( #5552 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-31 12:46:32 +02:00
LocalAI [bot]
f0c41d6405
chore: ⬆️ Update ggml-org/whisper.cpp to 98dfe8dc264b7d0d1daccfff9a9c043bcc2ece4b ( #5542 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-31 08:51:15 +02:00
Ettore Di Giacinto
8472321a81
feat(ui): display thinking tags appropriately ( #5540 )
...
* fix(streaming): stream complete runes
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(ui): display thinking tags separately
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-31 08:50:46 +02:00
Ettore Di Giacinto
3bac4724ac
fix(streaming): stream complete runes ( #5539 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-31 08:48:05 +02:00
Ettore Di Giacinto
59db154cbc
feat(ui): allow to upload PDF and text files, also add support to multiple input files ( #5538 )
...
* Support file inputs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: support multiple files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* show preview of files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-31 08:47:48 +02:00
Ettore Di Giacinto
1cc4525f15
fix: adapt test to error changes
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-30 17:43:59 +02:00
Ettore Di Giacinto
45c58752e5
feat(ui): add audio upload button in chat view ( #5526 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-30 16:47:31 +02:00
Ettore Di Giacinto
d5c9c717b5
feat(chatterbox): add new backend ( #5524 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-30 10:52:55 +02:00
Ettore Di Giacinto
dd7fa6b9f7
chore(deps): bump llama.cpp to 'e83ba3e460651b20a594e9f2f0f0bffb998d3ce1 ( #5527 )
...
chore(deps): bump llama.cpp to 'e83ba3e460651b20a594e9f2f0f0bffb998d3ce1'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-30 10:29:01 +02:00
LocalAI [bot]
039c318607
chore: ⬆️ Update ggml-org/whisper.cpp to e5e900dd00747f747143ad30a697c8f21ddcd59e ( #5522 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-30 08:34:52 +02:00
Ettore Di Giacinto
0870bf5af6
fix(input): handle correctly case where we pass by string list as inputs ( #5521 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-29 22:06:42 +02:00
Ettore Di Giacinto
6073b9944e
chore(model gallery): add moondream2-20250414 ( #5518 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-29 10:47:11 +02:00
LocalAI [bot]
ef0e0f3777
chore: ⬆️ Update ggml-org/whisper.cpp to 1f5fdbecb411a61b8576242e5170c5ecef24b05a ( #5515 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-29 09:45:23 +02:00
LocalAI [bot]
b7de9e0aa0
chore: ⬆️ Update ggml-org/llama.cpp to d98f2a35fcf4a8d3e660ad48cd19e2a1f3d5b2ef ( #5514 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-29 09:44:36 +02:00
Ettore Di Giacinto
39292407a1
chore(model gallery): add pku-ds-lab_fairyr1-32b ( #5517 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-29 09:43:45 +02:00
Ettore Di Giacinto
f257bf8d14
chore(model gallery): add pku-ds-lab_fairyr1-14b-preview ( #5516 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-29 09:37:08 +02:00
Ettore Di Giacinto
8ca2fb5ef1
chore(model gallery): add qwen2.5-omni-7b ( #5513 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-28 18:15:09 +02:00
LocalAI [bot]
3a790fed13
chore: ⬆️ Update ggml-org/whisper.cpp to 0ed00d9d30e8c984936ff9ed9a4fcd475d6d82e5 ( #5510 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-28 09:00:22 +02:00
LocalAI [bot]
a334f28a07
chore: ⬆️ Update ggml-org/llama.cpp to a3c30846e410c91c11d7bf80978795a03bb03dee ( #5509 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-28 01:39:38 +00:00
Ettore Di Giacinto
dc6663d121
fix(template): we do not always have .Name ( #5508 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 18:44:24 +02:00
LocalAI [bot]
103caf9823
chore: ⬆️ Update ggml-org/llama.cpp to a26c4cc11ec7c6574e3691e90ecdbd67deeea35b ( #5500 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-27 17:13:55 +02:00
Ettore Di Giacinto
4226d2d837
Update index.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-27 10:24:37 +02:00
Ettore Di Giacinto
7434256fc9
chore(model gallery): add ms-24b-mullein-v0 ( #5506 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 10:14:52 +02:00
Ettore Di Giacinto
86a0563ae1
chore(model gallery): add llama3-24b-mullein-v1 ( #5505 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 10:13:40 +02:00
Ettore Di Giacinto
c68951cbfe
chore(model gallery): add mrm8488_qwen3-14b-ft-limo ( #5504 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 10:04:16 +02:00
Ettore Di Giacinto
8408084120
chore(model gallery): add luckyrp-24b ( #5503 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 10:02:25 +02:00
Ettore Di Giacinto
0f2f4c7e23
chore(model gallery): add allura-org_q3-30b-a3b-designant ( #5502 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 09:59:56 +02:00
Ettore Di Giacinto
5ffad3b004
chore(deps): remove pin on transformers ( #5501 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 09:24:27 +02:00
Ettore Di Giacinto
e5ccd97b8c
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-26 20:00:31 +02:00
LocalAI [bot]
a3b08d46ec
chore: ⬆️ Update ggml-org/whisper.cpp to ea9f206f18d86c4eb357db9fdc52e4d9dc24435e ( #5464 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-26 19:56:44 +02:00
Ettore Di Giacinto
090f5065fc
chore(deps): bump llama.cpp to 'fef693dc6b959a8e8ba11558fbeaad0b264dd457' ( #5467 )
...
Also try to use a smaller model for integration tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-26 17:19:46 +02:00
Ettore Di Giacinto
88de2ea01a
feat(llama.cpp): add support for audio input ( #5466 )
...
* feat(llama.cpp): add support for audio input
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-26 16:06:03 +02:00
Ettore Di Giacinto
9650d490d4
chore(model gallery): add nvidia_acereason-nemotron-14b ( #5463 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-26 10:08:10 +02:00
Ettore Di Giacinto
4de1c83764
chore(model gallery): add allura-org_q3-30b-a3b-pentiment ( #5462 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-26 09:46:44 +02:00
Ettore Di Giacinto
e5978dc714
chore(model gallery): add medgemma-27b-text-it ( #5461 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-26 09:44:13 +02:00
Ettore Di Giacinto
f784986e19
chore(model gallery): add medgemma-4b-it ( #5460 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-26 09:41:09 +02:00
Richard Palethorpe
bf6426aef2
feat: Realtime API support reboot ( #5392 )
...
* feat(realtime): Initial Realtime API implementation
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: go mod tidy
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* feat: Implement transcription only mode for realtime API
Reduce the scope of the real time API for the initial realease and make
transcription only mode functional.
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* chore(build): Build backends on a separate layer to speed up core only changes
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-25 22:25:05 +02:00
LocalAI [bot]
4a91950848
chore: ⬆️ Update ggml-org/llama.cpp to d13d0f6135803822ec1cd7e3efb49360b88a1bdf ( #5448 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-24 08:50:41 +02:00
LocalAI [bot]
4614ea1685
chore: ⬆️ Update ggml-org/whisper.cpp to 13d92d08ae26031545921243256aaaf0ee057943 ( #5449 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-23 23:44:06 +00:00
Ettore Di Giacinto
f0bf59d1d9
chore(model gallery): add vulpecula-4b ( #5445 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-23 09:51:21 +02:00
Ettore Di Giacinto
83dd678959
chore(model gallery): add whiterabbitneo_whiterabbitneo-v3-7b ( #5444 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-23 09:46:28 +02:00
Ettore Di Giacinto
9d6c9f874a
chore(model gallery): add arliai_qwq-32b-arliai-rpr-v4 ( #5443 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-23 09:34:33 +02:00
LocalAI [bot]
c62f2bb336
chore: ⬆️ Update ggml-org/llama.cpp to 8a1d206f1d2b4e45918b589f3165b4be232f7ba8 ( #5440 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-23 09:22:38 +02:00
LocalAI [bot]
38aeca6f9c
chore: ⬆️ Update ggml-org/whisper.cpp to 78b31ca7824500e429ba026c1a9b48e0b41c50cb ( #5439 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-23 06:22:41 +00:00
Ettore Di Giacinto
3b0cf52f6a
feat(llama.cpp): add reranking ( #5396 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-22 21:49:30 +02:00
LocalAI [bot]
bac3022044
chore: ⬆️ Update ggml-org/whisper.cpp to bd1cb0c8e3a04baa411dc12c1325b6a9f12ee7f4 ( #5424 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-22 21:49:06 +02:00
LocalAI [bot]
cd41701524
chore: ⬆️ Update ggml-org/llama.cpp to 8e186ef0e764c7a620e402d1f76ebad60bf31c49 ( #5423 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-22 21:48:51 +02:00
Ettore Di Giacinto
6a382a1afe
fix(transformers): try to pin to working release ( #5426 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-22 12:50:51 +02:00
Ettore Di Giacinto
8dcab2f9c7
chore(scripts): allow to specify quants ( #5430 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-22 11:53:30 +02:00
Ettore Di Giacinto
1d1d5627f0
chore(model gallery): add delta-vector_archaeo-12b-v2 ( #5429 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-22 11:38:48 +02:00
Ettore Di Giacinto
233b3369ad
chore(model gallery): add mistralai_devstral-small-2505 ( #5428 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-22 11:37:17 +02:00
Ettore Di Giacinto
c587ac0aef
chore(model gallery): add nvidia_llama-3.1-nemotron-nano-4b-v1.1 ( #5427 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-22 11:33:33 +02:00
David Thole
38c5d16b57
feat(docs): updating the documentation on fine tuning and advanced guide. ( #5420 )
...
updating the documentation on fine tuning and advanced guide. This mirrors how modern version of llama.cpp operate
2025-05-21 19:11:00 +02:00
LocalAI [bot]
ef6fc052eb
chore: ⬆️ Update ggml-org/llama.cpp to b7a17463ec190aeee7b9077c606c910fb4688b84 ( #5399 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-21 09:06:09 +02:00
LocalAI [bot]
7ff35c08ac
chore: ⬆️ Update ggml-org/whisper.cpp to 62dc8f7d7b72ca8e75c57cd6a100712c631fa5d5 ( #5398 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-21 09:00:42 +02:00
LocalAI [bot]
43f75ee7f3
chore(model-gallery): ⬆️ update checksum ( #5422 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-21 03:52:39 +00:00
Ettore Di Giacinto
82811a9630
fix(transformers): pin protobuf ( #5421 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 20:28:31 +02:00
Ettore Di Giacinto
04a3d8e5ac
feat(ui): add error page to display errors ( #5418 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 12:17:27 +02:00
Ettore Di Giacinto
9af09b3f8c
chore(model gallery): fixup
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 12:17:21 +02:00
Ettore Di Giacinto
0d590a4044
chore(model gallery): add smolvlm2-256m-video-instruct ( #5417 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 12:03:02 +02:00
Ettore Di Giacinto
e0a54de4f5
chore(model gallery): add smolvlm2-500m-video-instruct ( #5416 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 11:42:30 +02:00
Ettore Di Giacinto
6bc2ae5467
chore(model gallery): add smolvlm2-2.2b-instruct ( #5415 )
...
chore(model gallery): add smolvlm-instruct
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 11:36:22 +02:00
Ettore Di Giacinto
8caaf49f5d
chore(model gallery): add smolvlm-instruct ( #5414 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 11:35:01 +02:00
Ettore Di Giacinto
1db51044bb
chore(model gallery): add smolvlm-500m-instruct ( #5413 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 11:25:32 +02:00
Ettore Di Giacinto
ec21b58008
chore(model gallery): add smolvlm-256m-instruct ( #5412 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 11:15:09 +02:00
Ettore Di Giacinto
996259b529
chore(model gallery): add facebook_kernelllm ( #5411 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 10:31:09 +02:00
Ettore Di Giacinto
f2942cc0e1
chore(model gallery): add thedrummer_valkyrie-49b-v1 ( #5410 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 10:28:27 +02:00
Ettore Di Giacinto
f8fbfd4fa3
chore(model gallery): add a-m-team_am-thinking-v1 ( #5395 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-19 17:31:38 +02:00
Ettore Di Giacinto
41e239c67e
chore(model gallery): add soob3123_grayline-qwen3-8b ( #5394 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-19 17:02:43 +02:00
Ettore Di Giacinto
587827e779
chore(model gallery): add soob3123_grayline-qwen3-14b ( #5393 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-19 15:59:07 +02:00
LocalAI [bot]
456b4982ef
chore: ⬆️ Update ggml-org/llama.cpp to 6a2bc8bfb7cd502e5ebc72e36c97a6f848c21c2c ( #5390 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-19 01:25:22 +00:00
Ettore Di Giacinto
159388cce8
chore: memoize detected GPUs ( #5385 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-18 08:55:44 +02:00
LocalAI [bot]
cfc73c7773
chore: ⬆️ Update ggml-org/llama.cpp to e3a7cf6c5bf6a0a24217f88607b06e4405a2b5d9 ( #5384 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-18 01:21:13 +00:00
Ettore Di Giacinto
6d5bde860b
feat(llama.cpp): upgrade and use libmtmd ( #5379 )
...
* WIP
* wip
* wip
* Make it compile
* Update json.hpp
* this shouldn't be private for now
* Add logs
* Reset auto detected template
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Re-enable grammars
* This seems to be broken - 360a9c98e1 (diff-a18a8e64e12a01167d8e98fc) […]cccf0d4eed09d76d879L2998-L3207
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Placeholder
* Simplify image loading
* use completion type
* disable streaming
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* correctly return timings
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Remove some debug logging
* Adapt tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Keep header
* embedding: do not use oai type
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Sync from server.cpp
* Use utils and json directly from llama.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Sync with upstream
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: copy json.hpp from the correct location
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: add httplib
* sync llama.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Embeddiongs: set OAICOMPAT_TYPE_EMBEDDING
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: sync with server.cpp by including it
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* make it darwin-compatible
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-17 16:02:53 +02:00
LocalAI [bot]
6ef383033b
chore: ⬆️ Update ggml-org/whisper.cpp to d1f114da61b1ae1e70b03104fad42c9dd666feeb ( #5381 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-17 00:35:17 +00:00
Richard Palethorpe
cd494089d9
fix(flux): Set CFG=1 so that prompts are followed ( #5378 )
...
The recommendation with Flux is to set CFG to 1 as shown in the
stablediffusion-cpp README.
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-05-16 17:53:54 +02:00
LocalAI [bot]
3033845f94
chore: ⬆️ Update ggml-org/whisper.cpp to 20a20decd94badfd519a07ea91f0bba8b8fc4dea ( #5374 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-16 12:46:16 +02:00
omahs
0f365ac204
fix: typos ( #5376 )
...
Signed-off-by: omahs <73983677+omahs@users.noreply.github.com >
2025-05-16 12:45:48 +02:00
Ettore Di Giacinto
525cf198be
chore(model gallery): add primeintellect_intellect-2 ( #5373 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-15 10:53:52 +02:00
Ettore Di Giacinto
658c2a4f55
chore(model gallery): add thedrummer_rivermind-lux-12b-v1 ( #5372 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-15 10:51:55 +02:00
Ettore Di Giacinto
c987de090d
chore(model gallery): add thedrummer_snowpiercer-15b-v1 ( #5371 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-15 10:04:44 +02:00
Ettore Di Giacinto
04365843e6
chore(model gallery): add skywork_skywork-or1-7b ( #5370 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-15 10:02:07 +02:00
Ettore Di Giacinto
1dc5781679
chore(model gallery): add skywork_skywork-or1-32b ( #5369 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-15 09:58:51 +02:00
LocalAI [bot]
30704292de
chore: ⬆️ Update ggml-org/whisper.cpp to f389d7e3e56bbbfec49fd333551927a0fcbb7213 ( #5367 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-15 00:34:16 +00:00
Ettore Di Giacinto
e52c66c76e
chore(docs/install.sh): image changes ( #5354 )
...
chore(docs): image changes
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-14 19:28:30 +02:00
LocalAI [bot]
cb28aef93b
chore: ⬆️ Update ggml-org/whisper.cpp to f89056057511a1657af90bb28ef3f21e5b1f33cd ( #5364 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-14 09:24:16 +02:00
LocalAI [bot]
029f97c2a2
docs: ⬆️ update docs version mudler/LocalAI ( #5363 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-14 01:54:34 +00:00
Ettore Di Giacinto
3be71be696
fix(ci): tag latest against cpu-only image ( #5362 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-13 22:00:41 +02:00
LocalAI [bot]
6adb019f8f
chore: ⬆️ Update ggml-org/llama.cpp to de4c07f93783a1a96456a44dc16b9db538ee1618 ( #5358 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-13 22:00:19 +02:00
LocalAI [bot]
fcaa0a2f01
chore: ⬆️ Update ggml-org/whisper.cpp to e41bc5c61ae66af6be2bd7011769bb821a83e8ae ( #5357 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-13 21:59:50 +02:00
dependabot[bot]
fd17a3312c
chore(deps): bump securego/gosec from 2.22.3 to 2.22.4 ( #5356 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.3 to 2.22.4.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.3...v2.22.4 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.4
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-05-12 22:01:43 +02:00
dependabot[bot]
12d0fe610b
chore(deps): bump dependabot/fetch-metadata from 2.3.0 to 2.4.0 ( #5355 )
...
Bumps [dependabot/fetch-metadata](https://github.com/dependabot/fetch-metadata ) from 2.3.0 to 2.4.0.
- [Release notes](https://github.com/dependabot/fetch-metadata/releases )
- [Commits](https://github.com/dependabot/fetch-metadata/compare/v2.3.0...v2.4.0 )
---
updated-dependencies:
- dependency-name: dependabot/fetch-metadata
dependency-version: 2.4.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-05-12 22:01:19 +02:00
Ettore Di Giacinto
11c67d16b8
chore(ci): strip 'core' in the image suffix, identify python-based images with 'extras' ( #5353 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-12 09:36:59 +02:00
LocalAI [bot]
63f7c86c4d
chore: ⬆️ Update ggml-org/llama.cpp to 9a390c4829cd3058d26a2e2c09d16e3fd12bf1b1 ( #5351 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-12 09:24:54 +02:00
LocalAI [bot]
ac89bf77bf
chore: ⬆️ Update ggml-org/whisper.cpp to 2e310b841e0b4e7cf00890b53411dd9f8578f243 ( #4785 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-12 01:30:35 +00:00
Ettore Di Giacinto
0395cc02fb
chore(model gallery): add qwen_qwen2.5-vl-72b-instruct ( #5349 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-11 09:46:32 +02:00
Ettore Di Giacinto
616972fca0
chore(model gallery): add qwen_qwen2.5-vl-7b-instruct ( #5348 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-11 09:44:58 +02:00
Ettore Di Giacinto
942fbff62d
chore(model gallery): add gryphe_pantheon-proto-rp-1.8-30b-a3b ( #5347 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-11 09:39:28 +02:00
LocalAI [bot]
2612a0c910
chore: ⬆️ Update ggml-org/llama.cpp to 15e6125a397f6086c1dfdf7584acdb7c730313dc ( #5345 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-11 09:21:46 +02:00
LocalAI [bot]
2dcb6d7247
chore(model-gallery): ⬆️ update checksum ( #5346 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-10 22:24:04 +02:00
Ettore Di Giacinto
6978eec69f
feat(whisper.cpp): gpu support ( #5344 )
...
* fix(whisper.cpp): gpu support
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to fix apple tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-10 22:02:40 +02:00
LocalAI [bot]
2fcfe54466
chore: ⬆️ Update ggml-org/llama.cpp to 33eff4024084d1f0c8441b79f7208a52fad79858 ( #5343 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-10 10:07:39 +02:00
Ettore Di Giacinto
4e7506a3be
fix(whisper): add vulkan flag
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-10 08:46:21 +02:00
Ettore Di Giacinto
2a46217f90
Update Makefile
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-09 23:17:18 +02:00
Ettore Di Giacinto
31ff9dbd52
chore(Makefile): small cleanups, disable openmp on whisper
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 22:37:18 +02:00
Ettore Di Giacinto
9483abef03
fix(whisper/sycl): disable
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 22:36:09 +02:00
Ettore Di Giacinto
ce3e8b3e31
fix(whisper/sycl): use icx when running go build
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 21:48:09 +02:00
Ettore Di Giacinto
f3bb84c9a7
feat(whisper): link vulkan, hipblas and sycl
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 19:25:26 +02:00
Ettore Di Giacinto
ecb1297582
fix: specify icx and icpx only on whisper.cpp
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 10:58:30 +02:00
Ettore Di Giacinto
73fc702b3c
fix: this is not needed
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 10:28:53 +02:00
Ettore Di Giacinto
e3af62ae1a
feat: Add sycl support for whisper.cpp ( #5341 )
2025-05-09 09:31:02 +02:00
Ettore Di Giacinto
dc21604741
chore(deps): bump whisper.cpp ( #5338 )
...
* chore(deps): bump whisper.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* add libggml-metal
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups macOS arm64
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* adjust cublas for whisper.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 08:17:45 +02:00
LocalAI [bot]
5433f1a70e
chore: ⬆️ Update ggml-org/llama.cpp to f05a6d71a0f3dbf0730b56a1abbad41c0f42e63d ( #5340 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-08 23:13:28 +00:00
Ettore Di Giacinto
d5e032bdcd
chore(model gallery): add gemma-3-12b-fornaxv.2-qat-cot ( #5337 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-08 12:07:25 +02:00
Ettore Di Giacinto
de786f6586
chore(model gallery): add symiotic-14b-i1 ( #5336 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-08 12:03:35 +02:00
Ettore Di Giacinto
8b9bc4aa6e
chore(model gallery): add qwen3-14b-uncensored ( #5335 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-08 11:59:26 +02:00
Ettore Di Giacinto
e6cea7d28e
chore(model gallery): add cognition-ai_kevin-32b ( #5334 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-08 11:57:12 +02:00
Ettore Di Giacinto
7d7d56f2ce
chore(model gallery): add servicenow-ai_apriel-nemotron-15b-thinker ( #5333 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-08 11:55:35 +02:00
Ettore Di Giacinto
1caae91ab6
chore(model gallery): add qwen3-4b-esper3-i1 ( #5332 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-08 11:52:02 +02:00
LocalAI [bot]
e90f2cb0ca
chore: ⬆️ Update ggml-org/llama.cpp to 814f795e063c257f33b921eab4073484238a151a ( #5331 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-08 09:25:13 +02:00
Ettore Di Giacinto
5a4291fadd
docs: update README badges
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-07 22:20:06 +02:00
Ettore Di Giacinto
91ef58ee5a
chore(model gallery): add qwen3-14b-griffon-i1 ( #5330 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-07 11:07:38 +02:00
LocalAI [bot]
a86e8c78f1
chore: ⬆️ Update ggml-org/llama.cpp to 91a86a6f354aa73a7aab7bc3d283be410fdc93a5 ( #5329 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-06 23:39:10 +00:00
Ettore Di Giacinto
adb24214c6
chore(deps): bump llama.cpp to b34c859146630dff136943abc9852ca173a7c9d6 ( #5323 )
...
chore(deps): bump llama.cpp to 'b34c859146630dff136943abc9852ca173a7c9d6'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-06 11:21:25 +02:00
Ettore Di Giacinto
f03a0430aa
chore(model gallery): add claria-14b ( #5326 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-06 10:48:03 +02:00
Ettore Di Giacinto
73bc12abc0
chore(model gallery): add goekdeniz-guelmez_josiefied-qwen3-8b-abliterated-v1 ( #5325 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-06 10:38:20 +02:00
Ettore Di Giacinto
7fa437bbcc
chore(model gallery): add huihui-ai_qwen3-14b-abliterated ( #5324 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-06 10:35:55 +02:00
LocalAI [bot]
4a27c99928
chore(model-gallery): ⬆️ update checksum ( #5321 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-06 10:01:28 +02:00
Ettore Di Giacinto
6ce94834b6
fix(hipblas): do not build all cpu-specific flags ( #5322 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-06 10:00:50 +02:00
dependabot[bot]
84a26458dc
chore(deps): bump mxschmitt/action-tmate from 3.21 to 3.22 ( #5319 )
...
Bumps [mxschmitt/action-tmate](https://github.com/mxschmitt/action-tmate ) from 3.21 to 3.22.
- [Release notes](https://github.com/mxschmitt/action-tmate/releases )
- [Changelog](https://github.com/mxschmitt/action-tmate/blob/master/RELEASE.md )
- [Commits](https://github.com/mxschmitt/action-tmate/compare/v3.21...v3.22 )
---
updated-dependencies:
- dependency-name: mxschmitt/action-tmate
dependency-version: '3.22'
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-05-05 22:17:59 +00:00
Ettore Di Giacinto
7aa377b6a9
fix(arm64): do not build instructions which are not available ( #5318 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-05 17:30:00 +02:00
Ettore Di Giacinto
64e66dda4a
chore(model gallery): add allura-org_remnant-qwen3-8b ( #5317 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-05 11:09:07 +02:00
LocalAI [bot]
a085f61fdc
chore: ⬆️ Update ggml-org/llama.cpp to 9fdfcdaeddd1ef57c6d041b89cd8fb7048a0f028 ( #5316 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-04 23:00:25 +00:00
Ettore Di Giacinto
21bdfe5fa4
fix: use rice when embedding large binaries ( #5309 )
...
* fix(embed): use go-rice for large backend assets
Golang embed FS has a hard limit that we might exceed when providing
many binary alternatives.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* simplify golang deps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(tests): switch to testcontainers and print logs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(tests): do not build a test binary
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* small fixup
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-04 16:42:42 +02:00
Ettore Di Giacinto
7ebd7b2454
chore(model gallery): add rei-v3-kto-12b ( #5313 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-04 09:41:35 +02:00
Ettore Di Giacinto
6984749ea1
chore(model gallery): add kalomaze_qwen3-16b-a3b ( #5312 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-04 09:39:38 +02:00
Ettore Di Giacinto
c0a206bc7a
chore(model gallery): add qwen3-30b-a1.5b-high-speed ( #5311 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-04 09:38:01 +02:00
LocalAI [bot]
01bbb31fb3
chore: ⬆️ Update ggml-org/llama.cpp to 36667c8edcded08063ed51c7d57e9e086bbfc903 ( #5300 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-04 09:23:01 +02:00
Ettore Di Giacinto
72111c597d
fix(gpu): do not assume gpu being returned has node and mem ( #5310 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 19:00:24 +02:00
Ettore Di Giacinto
b2f9fc870b
chore(defaults): enlarge defaults, drop gpu layers which is infered ( #5308 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 18:44:51 +02:00
Ettore Di Giacinto
1fc6d469ac
chore(deps): bump llama.cpp to '1d36b3670b285e69e58b9d687c770a2a0a192194 ( #5307 )
...
chore(deps): bump llama.cpp to '1d36b3670b285e69e58b9d687c770a2a0a192194'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 18:44:40 +02:00
Ettore Di Giacinto
05848b2027
chore(model gallery): add smoothie-qwen3-8b ( #5306 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 10:35:20 +02:00
Ettore Di Giacinto
1da0644aa3
chore(model gallery): add qwen-3-32b-medical-reasoning-i1 ( #5305 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 10:24:07 +02:00
Ettore Di Giacinto
c087cd1377
chore(model gallery): add amoral-qwen3-14b ( #5304 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 10:21:48 +02:00
Ettore Di Giacinto
c621412f6a
chore(model gallery): add comet_12b_v.5-i1 ( #5303 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 10:20:03 +02:00
Ettore Di Giacinto
5a8b1892cd
chore(model gallery): add genericrpv3-4b ( #5302 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 10:18:31 +02:00
Ettore Di Giacinto
5b20426863
chore(model gallery): add planetoid_27b_v.2 ( #5301 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 10:14:33 +02:00
Ettore Di Giacinto
5c6cd50ed6
feat(llama.cpp): estimate vram usage ( #5299 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 17:40:26 +02:00
Ettore Di Giacinto
bace6516f1
chore(model gallery): add webthinker-qwq-32b-i1 ( #5298 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 09:57:49 +02:00
Ettore Di Giacinto
3baadf6f27
chore(model gallery): add shuttleai_shuttle-3.5 ( #5297 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 09:48:11 +02:00
Ettore Di Giacinto
8804c701b8
chore(model gallery): add microsoft_phi-4-reasoning ( #5296 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 09:46:20 +02:00
Ettore Di Giacinto
7b3ceb19bb
chore(model gallery): add microsoft_phi-4-reasoning-plus ( #5295 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 09:43:38 +02:00
Ettore Di Giacinto
e7f3effea1
chore(model gallery): add furina-8b ( #5294 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 09:39:22 +02:00
Ettore Di Giacinto
61694a2ffb
chore(model gallery): add josiefied-qwen3-8b-abliterated-v1 ( #5293 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 09:36:35 +02:00
LocalAI [bot]
573a3f104c
chore: ⬆️ Update ggml-org/llama.cpp to d7a14c42a1883a34a6553cbfe30da1e1b84dfd6a ( #5292 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-02 09:21:38 +02:00
Ettore Di Giacinto
0e8af53a5b
chore: update quickstart
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-01 22:36:33 +02:00
Ettore Di Giacinto
960ffa808c
chore(model gallery): add microsoft_phi-4-mini-reasoning ( #5288 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-01 10:17:58 +02:00
Ettore Di Giacinto
92719568e5
chore(model gallery): add fast-math-qwen3-14b ( #5287 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-01 10:14:51 +02:00
Ettore Di Giacinto
163939af71
chore(model gallery): add qwen3-8b-jailbroken ( #5286 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-01 10:13:01 +02:00
Ettore Di Giacinto
399f1241dc
chore(model gallery): add qwen3-30b-a3b-abliterated ( #5285 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-01 10:07:42 +02:00
LocalAI [bot]
58c9ade2e8
chore: ⬆️ Update ggml-org/llama.cpp to 3e168bede4d27b35656ab8026015b87659ecbec2 ( #5284 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-01 10:01:39 +02:00
Ettore Di Giacinto
6e1c93d84f
fix(ci): comment out vllm tests
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-01 10:01:22 +02:00
Wyatt Neal
4076ea0494
fix: vllm missing logprobs ( #5279 )
...
* working to address missing items
referencing #3436 , #2930 - if i could test it, this might show that the
output from the vllm backend is processed and returned to the user
Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com >
* adding in vllm tests to test-extras
Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com >
* adding in tests to pipeline for execution
Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com >
* removing todo block, test via pipeline
Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com >
---------
Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com >
2025-04-30 12:55:07 +00:00
Ettore Di Giacinto
26cbf77c0d
chore(model gallery): add mlabonne_qwen3-4b-abliterated ( #5283 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-30 11:09:58 +02:00
Ettore Di Giacinto
640790d628
chore(model gallery): add mlabonne_qwen3-8b-abliterated ( #5282 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-30 11:08:26 +02:00
Ettore Di Giacinto
4132adea2f
chore(model gallery): add mlabonne_qwen3-14b-abliterated ( #5281 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-30 11:04:49 +02:00
LocalAI [bot]
2b2d907a3a
chore: ⬆️ Update ggml-org/llama.cpp to e2e1ddb93a01ce282e304431b37e60b3cddb6114 ( #5278 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-29 21:46:08 +00:00
Ettore Di Giacinto
6e8f4f584b
fix(diffusers): consider options only in form of key/value ( #5277 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 17:08:55 +02:00
Richard Palethorpe
662cfc2b48
fix(aio): Fix copypasta in download files for gpt-4 model ( #5276 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-29 17:08:16 +02:00
Ettore Di Giacinto
a25d355d66
chore(model gallery): add qwen3-0.6b ( #5275 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 10:10:16 +02:00
Ettore Di Giacinto
6d1cfdbefc
chore(model gallery): add qwen3-1.7b ( #5274 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 10:06:03 +02:00
Ettore Di Giacinto
5ecc478968
chore(model gallery): add qwen3-4b ( #5273 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 10:01:22 +02:00
Ettore Di Giacinto
aef5c4291b
chore(model gallery): add qwen3-8b ( #5272 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 09:59:17 +02:00
Ettore Di Giacinto
c059f912b9
chore(model gallery): add qwen3-14b ( #5271 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 09:56:50 +02:00
LocalAI [bot]
bc1e059259
chore: ⬆️ Update ggml-org/llama.cpp to 5f5e39e1ba5dbea814e41f2a15e035d749a520bc ( #5267 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-29 09:49:42 +02:00
LocalAI [bot]
38dc07793a
chore(model-gallery): ⬆️ update checksum ( #5268 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-29 09:49:23 +02:00
Ettore Di Giacinto
da6ef0967d
chore(model gallery): add qwen3-32b ( #5270 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 09:48:28 +02:00
Ettore Di Giacinto
7a011e60bd
chore(model gallery): add qwen3-30b-a3b ( #5269 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 09:44:44 +02:00
dependabot[bot]
e13dd5b09f
chore(deps): bump appleboy/scp-action from 0.1.7 to 1.0.0 ( #5265 )
...
Bumps [appleboy/scp-action](https://github.com/appleboy/scp-action ) from 0.1.7 to 1.0.0.
- [Release notes](https://github.com/appleboy/scp-action/releases )
- [Changelog](https://github.com/appleboy/scp-action/blob/master/.goreleaser.yaml )
- [Commits](https://github.com/appleboy/scp-action/compare/v0.1.7...v1.0.0 )
---
updated-dependencies:
- dependency-name: appleboy/scp-action
dependency-version: 1.0.0
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-04-28 22:36:30 +00:00
Ettore Di Giacinto
86ee303bd6
chore(model gallery): add nvidia_openmath-nemotron-14b-kaggle ( #5264 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-28 19:52:36 +02:00
Ettore Di Giacinto
978ee96fd3
chore(model gallery): add nvidia_openmath-nemotron-14b ( #5263 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-28 19:43:49 +02:00
Ettore Di Giacinto
3ad5691db6
chore(model gallery): add nvidia_openmath-nemotron-7b ( #5262 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-28 19:41:59 +02:00
Ettore Di Giacinto
0027681090
chore(model gallery): add nvidia_openmath-nemotron-1.5b ( #5261 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-28 19:40:09 +02:00
Ettore Di Giacinto
8cba990edc
chore(model gallery): add nvidia_openmath-nemotron-32b ( #5260 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-28 19:36:57 +02:00
Simon Redman
88857696d4
fix(CUDA): Add note for how to run CUDA with SELinux ( #5259 )
...
* Add note to help run nvidia containers with SELinux
* Use correct CUDA container references as noted in the dockerhub overview
* Clean trailing whitespaces
2025-04-28 09:00:52 +02:00
LocalAI [bot]
23f347e687
chore: ⬆️ Update ggml-org/llama.cpp to ced44be34290fab450f8344efa047d8a08e723b4 ( #5258 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-27 21:59:35 +00:00
Mohit Gaur
b6e3dc5f02
docs: update docs for DisableWebUI flag ( #5256 )
...
Signed-off-by: Mohit Gaur <56885276+Mohit-Gaur@users.noreply.github.com >
2025-04-27 16:02:02 +02:00
Alessandro Pirastru
69667521e2
fix(install/gpu):Fix docker not being able to leverage the GPU on systems that have SELinux Enforced ( #5252 )
...
* Update installation script for improved compatibility and clarity
- Renamed VERSION to LOCALAI_VERSION to avoid conflicts with system variables.
- Enhanced NVIDIA and CUDA repository installation for DNF5 compatibility.
- Adjusted default Fedora version handling for CUDA installation.
- Updated Docker image tag handling to use LOCALAI_VERSION consistently.
- Improved logging messages for repository and LocalAI binary downloads.
- Added a temporary bypass for nvidia-smi installation on Fedora Cloud Edition.
* feat: Add SELinux configuration for NVIDIA GPU support in containers
- Introduced `enable_selinux_container_booleans` function to handle SELinux configuration changes for GPU access.
- Included user confirmation prompt to enable SELinux `container_use_devices` boolean due to security implications.
- Added NVIDIA Container Runtime to Docker runtimes and restarted Docker to ensure proper GPU support.
- Applied SELinux adjustments conditionally for Fedora, RHEL, CentOS, Rocky, and openSUSE distributions.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* fix: Correct SELinux boolean parsing and add loop break
- Fixed incorrect parsing of `container_use_devices` boolean by changing the awk field from `$2` to `$3` to retrieve the correct value.
- Added a `break` statement after enabling the SELinux boolean to prevent unnecessary loop iterations after user prompt.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* fix: typo in install.sh
Signed-off-by: Alessandro Pirastru <57262788+Bloodis94@users.noreply.github.com >
---------
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
Signed-off-by: Alessandro Pirastru <57262788+Bloodis94@users.noreply.github.com >
2025-04-27 16:01:29 +02:00
LocalAI [bot]
2a92effc5d
chore: ⬆️ Update ggml-org/llama.cpp to 77d5e9a76a7b4a8a7c5bf9cf6ebef91860123cba ( #5254 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-27 09:21:02 +02:00
Simon Redman
a65e012aa2
docs(Vulkan): Add GPU docker documentation for Vulkan ( #5255 )
...
Add GPU docker documentation for Vulkan
2025-04-27 09:20:26 +02:00
Ettore Di Giacinto
8e9b41d05f
chore(ci): build only images with ffmpeg included, simplify tags ( #5251 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-27 08:23:25 +02:00
LocalAI [bot]
078da5c2f0
feat(swagger): update swagger ( #5253 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-26 22:40:35 +00:00
Ettore Di Giacinto
c5af5d139c
Update index.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-26 18:42:22 +02:00
Ettore Di Giacinto
2c9279a542
feat(video-gen): add endpoint for video generation ( #5247 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-26 18:05:01 +02:00
Ettore Di Giacinto
a67d22f5f2
chore(model gallery): add mmproj to gemma3 models (now working)
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-26 17:31:40 +02:00
Ettore Di Giacinto
dc7c51dcc7
chore(model gallery): fix correct filename for gemma-3-27b-it-qat
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-26 17:27:50 +02:00
Ettore Di Giacinto
98df65c7aa
chore(model gallery): add l3.3-genetic-lemonade-sunset-70b ( #5250 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-26 17:19:20 +02:00
Ettore Di Giacinto
1559b6b522
chore(model gallery): add l3.3-geneticlemonade-unleashed-v2-70b ( #5249 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-26 17:17:18 +02:00
Alessandro Pirastru
a0244e3fb4
feat(install): added complete process for installing nvidia drivers on fedora without pulling X11 ( #5246 )
...
* Update installation script for improved compatibility and clarity
- Renamed VERSION to LOCALAI_VERSION to avoid conflicts with system variables.
- Enhanced NVIDIA and CUDA repository installation for DNF5 compatibility.
- Adjusted default Fedora version handling for CUDA installation.
- Updated Docker image tag handling to use LOCALAI_VERSION consistently.
- Improved logging messages for repository and LocalAI binary downloads.
- Added a temporary bypass for nvidia-smi installation on Fedora Cloud Edition.
* Enhance log functions with ANSI color formatting
- Added ANSI escape codes for improved log styling: light blue for info, orange for warnings, and red for errors.
- Updated all log functions (`info`, `warn`, `fatal`) to include bold and colored output.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* feat: Enhance log functions with ANSI color formatting
- Added ANSI escape codes for improved log styling: light blue for info, orange for warnings, and red for errors.
- Updated all log functions (`info`, `warn`, `fatal`) to include bold and colored output.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* chore: ⬆️ Update ggml-org/llama.cpp to `ecda2ec4b347031a9b8a89ee2efc664ce63f599c` (#5238 )
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* fix(stablediffusion-ggml): Build with DSD CUDA, HIP and Metal flags (#5236 )
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* feat(install): enhance script with choice functions and logs
- Added custom `choice_info`, `choice_warn`, and `choice_fatal` functions for interactive input logging.
- Adjusted Docker volume creation message for better clarity.
- Included NVIDIA driver check log for improved feedback to users.
- Added consistent logging before starting LocalAI Docker containers across configurations.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* feat(install): add Fedora NVIDIA driver installation option
- Introduced a new function to install NVIDIA kernel drivers on Fedora using akmod packages.
- Added user prompt to choose between installing drivers automatically or exiting for manual setup.
- Integrated the new function into the existing Fedora-specific CUDA toolkit installation workflow.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* fix(install): correct repository ID for DNF5 configuration
- Update repository ID from 'nome-repo' to 'nvidia-cuda' for DNF5.
- Ensures the correct repository name matches expected configuration.
- Fix prevents potential misconfiguration during installation process.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* feat(install): enhance NVIDIA driver handling on Fedora
- fixed `install_cuda_driver_yum` function call in `install_fedora_nvidia_kernel_drivers`
- Added `cuda-toolkit` for Fedora installations, as recommended by RPM Fusion.
- Adjusted driver repository commands for compatibility with DNF5.
- Improved URL and version handling for package manager installations.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* Refactor NVIDIA driver installation process in install.sh
- Removed redundant empty lines for cleaner formatting.
- Standardized URL formatting by removing unnecessary quotes around URLs.
- Reverted logic by removing Fedora-specific exclusions for cuda-toolkit and using `cuda-drivers` universally.
- Refined repository addition for `dnf` by explicitly setting `id` and `name` parameters for clarity and accuracy.
- Fixed minor formatting inconsistencies in parameter passing.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* feat: Update NVIDIA module installation warning in install script
- Clarified that Akmod installation may inhibit the reboot command.
- Added a cautionary note to the warning to inform users of potential risks.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* Update NVIDIA driver installation warning message
- Clarify prerequisites by noting the need for rpmfusion free/nonfree repos.
- Improve formatting of the warning box for better readability.
- Inform users that the script will install missing repos if necessary.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
---------
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: LocalAI [bot] <139863280+localai-bot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Richard Palethorpe <io@richiejp.com >
2025-04-26 09:44:40 +02:00
LocalAI [bot]
d66396201a
chore: ⬆️ Update ggml-org/llama.cpp to 295354ea6848a77bdee204ee1c971d9b92ffcca9 ( #5245 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-26 00:05:16 +02:00
Ettore Di Giacinto
9628860c0e
feat(llama.cpp/clip): inject gpu options if we detect GPUs ( #5243 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-26 00:04:47 +02:00
Ettore Di Giacinto
cae9bf1308
chore(deps): bump grpcio to 1.72.0 ( #5244 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-25 21:32:37 +02:00
Ettore Di Giacinto
5bb5da0760
fix(ci): add clang ( #5242 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-25 16:20:05 +02:00
Ettore Di Giacinto
867973a850
chore(model gallery): add soob3123_veritas-12b ( #5241 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-25 09:20:01 +02:00
LocalAI [bot]
701cd6b6d5
chore: ⬆️ Update ggml-org/llama.cpp to 226251ed56b85190e18a1cca963c45b888f4953c ( #5240 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-25 08:42:22 +02:00
Richard Palethorpe
7f61d397d5
fix(stablediffusion-ggml): Build with DSD CUDA, HIP and Metal flags ( #5236 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-24 10:27:17 +02:00
Alessandro Pirastru
1ae0b896fa
fix: installation script compatibility with fedora 41 and later, fedora headless unclear errors ( #5239 )
...
Update installation script for improved compatibility and clarity
- Renamed VERSION to LOCALAI_VERSION to avoid conflicts with system variables.
- Enhanced NVIDIA and CUDA repository installation for DNF5 compatibility.
- Adjusted default Fedora version handling for CUDA installation.
- Updated Docker image tag handling to use LOCALAI_VERSION consistently.
- Improved logging messages for repository and LocalAI binary downloads.
- Added a temporary bypass for nvidia-smi installation on Fedora Cloud Edition.
2025-04-24 09:34:25 +02:00
LocalAI [bot]
3937407cb3
chore: ⬆️ Update ggml-org/llama.cpp to ecda2ec4b347031a9b8a89ee2efc664ce63f599c ( #5238 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-24 09:32:08 +02:00
LocalAI [bot]
0e34ae4f3f
chore: ⬆️ Update ggml-org/llama.cpp to 658987cfc9d752dca7758987390d5fb1a7a0a54a ( #5234 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-23 09:13:49 +02:00
dependabot[bot]
a38b99ecb6
chore(deps): bump mxschmitt/action-tmate from 3.19 to 3.21 ( #5231 )
...
Bumps [mxschmitt/action-tmate](https://github.com/mxschmitt/action-tmate ) from 3.19 to 3.21.
- [Release notes](https://github.com/mxschmitt/action-tmate/releases )
- [Changelog](https://github.com/mxschmitt/action-tmate/blob/master/RELEASE.md )
- [Commits](https://github.com/mxschmitt/action-tmate/compare/v3.19...v3.21 )
---
updated-dependencies:
- dependency-name: mxschmitt/action-tmate
dependency-version: '3.21'
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-04-22 10:27:10 +02:00
LocalAI [bot]
a4a4358182
chore: ⬆️ Update ggml-org/llama.cpp to 1d735c0b4fa0551c51c2f4ac888dd9a01f447985 ( #5233 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-22 10:25:54 +02:00
Ettore Di Giacinto
4bc39c2db3
fix: typo on README link
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-21 22:13:14 +02:00
Ettore Di Giacinto
cc3df759f8
chore(docs): improve installer.sh docs ( #5232 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-21 22:11:43 +02:00
LocalAI [bot]
378161060c
chore: ⬆️ Update ggml-org/llama.cpp to 6602304814e679cc8c162bb760a034aceb4f8965 ( #5228 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-20 21:44:33 +00:00
Ettore Di Giacinto
f2f788fe60
chore(model gallery): add starrysky-12b-i1 ( #5224 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-20 10:26:30 +02:00
Ettore Di Giacinto
9fa8ed6b1e
chore(model gallery) add amoral-gemma3-1b-v2 ( #5223 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-20 10:23:24 +02:00
Ettore Di Giacinto
7fc37c5e29
chore(model gallery) add llama_3.3_70b_darkhorse-i1 ( #5222 )
...
chore(model gallery): add llama_3.3_70b_darkhorse-i1
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-20 10:20:58 +02:00
Ettore Di Giacinto
4bc4b1e8bc
chore(model gallery) update gemma3 qat models
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-20 10:11:12 +02:00
LocalAI [bot]
e495b89f18
chore: ⬆️ Update ggml-org/llama.cpp to 00137157fca3d17b90380762b4d7cc158d385bd3 ( #5218 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-19 23:50:35 +00:00
LocalAI [bot]
ba09eaea1b
feat(swagger): update swagger ( #5217 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-19 22:06:30 +02:00
Ettore Di Giacinto
61cc76c455
chore(autogptq): drop archived backend ( #5214 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-19 15:52:29 +02:00
Ettore Di Giacinto
8abecb4a18
chore: bump grpc limits to 50MB ( #5212 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-19 08:53:24 +02:00
LocalAI [bot]
8b3f76d8e6
chore: ⬆️ Update ggml-org/llama.cpp to 6408210082cc0a61b992b487be7e2ff2efbb9e36 ( #5211 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-18 21:45:48 +00:00
Ettore Di Giacinto
4e0497f1a6
chore(model gallery): add pictor-1338-qwenp-1.5b ( #5208 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-18 10:47:23 +02:00
Ettore Di Giacinto
ba88c9f451
chore(ci): use gemma-3-12b-it for models notifications (twitter)
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-18 10:38:36 +02:00
Ettore Di Giacinto
a598285825
chore(model gallery): add google-gemma-3-27b-it-qat-q4_0-small ( #5207 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-18 10:35:48 +02:00
Ettore Di Giacinto
cb7a172897
chore(ci): use gemma-3-12b-it for models notifications
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-18 10:20:33 +02:00
Ettore Di Giacinto
771be28dfb
ci: use gemma3 for notifications of releases
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-18 10:19:52 +02:00
Ettore Di Giacinto
7d6b3eb42d
chore(model gallery): add readyart_amoral-fallen-omega-gemma3-12b ( #5206 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-18 10:17:39 +02:00
Ettore Di Giacinto
0bb33fab55
chore(model gallery): add ibm-granite_granite-3.3-2b-instruct ( #5205 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-18 10:15:05 +02:00
Ettore Di Giacinto
e3bf7f77f7
chore(model gallery): add ibm-granite_granite-3.3-8b-instruct ( #5204 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-18 09:59:17 +02:00
LocalAI [bot]
bd1707d339
chore: ⬆️ Update ggml-org/llama.cpp to 2f74c354c0f752ed9aabf7d3a350e6edebd7e744 ( #5203 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-17 21:52:12 +00:00
Ettore Di Giacinto
0474804541
fix(ci): remove duplicate entry
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 19:51:21 +02:00
Ettore Di Giacinto
72693b3917
feat(install.sh): allow to uninstall with --uninstall ( #5202 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 16:32:23 +02:00
Florian Bachmann
a03b70010f
fix(talk): Talk interface sends content-type headers to chatgpt ( #5200 )
...
Talk interface sends content-type headers to chatgpt
Signed-off-by: baflo <834350+baflo@users.noreply.github.com >
2025-04-17 15:02:11 +02:00
Ettore Di Giacinto
e3717e5c1a
chore(model gallery): add qwen2.5-14b-instruct-1m ( #5201 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 10:42:22 +02:00
Ettore Di Giacinto
c8f6858218
chore(ci): add latest images for core ( #5198 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 10:00:18 +02:00
Ettore Di Giacinto
06d7cc43ae
chore(model gallery): add dreamgen_lucid-v1-nemo ( #5196 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 09:10:09 +02:00
Ettore Di Giacinto
f2147cb850
chore(model gallery): add thedrummer_rivermind-12b-v1 ( #5195 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 09:02:54 +02:00
Ettore Di Giacinto
75bb9f4c28
chore(model gallery): add menlo_rezero-v0.1-llama-3.2-3b-it-grpo-250404 ( #5194 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 09:00:11 +02:00
LocalAI [bot]
a2ef4b1e07
chore: ⬆️ Update ggml-org/llama.cpp to 015022bb53387baa8b23817ac03743705c7d472b ( #5192 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-17 08:04:37 +02:00
LocalAI [bot]
161c9fe2db
docs: ⬆️ update docs version mudler/LocalAI ( #5191 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-16 22:13:49 +02:00
Ettore Di Giacinto
7547463f81
Update quickstart.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-16 08:48:55 +02:00
Gianluca Boiano
32e4dfd47b
chore(model gallery): add suno-ai bark-cpp model ( #5187 )
...
Signed-off-by: Gianluca Boiano <morf3089@gmail.com >
2025-04-16 08:22:46 +02:00
Gianluca Boiano
f67e5dec68
fix: bark-cpp: assign FLAG_TTS to bark-cpp backend ( #5186 )
...
Signed-off-by: Gianluca Boiano <morf3089@gmail.com >
2025-04-16 08:21:30 +02:00
LocalAI [bot]
297d54acea
chore: ⬆️ Update ggml-org/llama.cpp to 80f19b41869728eeb6a26569957b92a773a2b2c6 ( #5183 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-15 22:50:32 +00:00
Ettore Di Giacinto
56f44d448c
chore(docs): decrease logo size, minor enhancements
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-15 22:00:51 +02:00
Richard Palethorpe
0f0fafacd9
fix(stablediffusion): Avoid overwriting SYCL specific flags from outer make call ( #5181 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-15 19:31:25 +02:00
Ettore Di Giacinto
4f239bac89
feat: rebrand - LocalAGI and LocalRecall joins the LocalAI stack family ( #5159 )
...
* wip
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update lotusdocs and hugo
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* rephrasing
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Latest fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adjust readme section
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-15 17:51:24 +02:00
Ettore Di Giacinto
04d74ac648
chore(model gallery): add m1-32b ( #5182 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-15 17:17:17 +02:00
Richard Palethorpe
18c3dc33ee
fix(stablediffusion): Pass ROCM LD CGO flags through to recursive make ( #5179 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-15 09:27:29 +02:00
LocalAI [bot]
508cfa7369
chore: ⬆️ Update ggml-org/llama.cpp to d6d2c2ab8c8865784ba9fef37f2b2de3f2134d33 ( #5178 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-14 23:10:16 +02:00
Ettore Di Giacinto
1f94cddbae
chore(model gallery): add nvidia_llama-3.1-8b-ultralong-4m-instruct ( #5177 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-14 12:30:55 +02:00
Ettore Di Giacinto
21ae7b4cd4
chore(model gallery): add nvidia_llama-3.1-8b-ultralong-1m-instruct ( #5176 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-14 12:28:09 +02:00
Ettore Di Giacinto
bef22ab547
chore(model gallery): add skywork_skywork-or1-32b-preview ( #5175 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-14 12:25:43 +02:00
Ettore Di Giacinto
eb04e8cdcf
chore(model gallery): add skywork_skywork-or1-math-7b ( #5174 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-14 12:23:44 +02:00
Ettore Di Giacinto
17e533a086
chore(model gallery): add skywork_skywork-or1-7b-preview ( #5173 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-14 12:20:20 +02:00
qwerty108109
4fc68409ff
Update README.md ( #5172 )
...
Modified the README.md to separate out the different docker run commands to make it easier to copy into the terminal.
Signed-off-by: qwerty108109 <97707491+qwerty108109@users.noreply.github.com >
2025-04-14 10:48:10 +02:00
Richard Palethorpe
e587044449
fix(stablediffusion): Avoid GGML commit which causes CUDA compile error ( #5170 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-14 09:29:09 +02:00
LocalAI [bot]
1f09db5161
chore: ⬆️ Update ggml-org/llama.cpp to 71e90e8813f90097701e62f7fce137d96ddf41e2 ( #5171 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-13 21:46:07 +00:00
Ettore Di Giacinto
05b744f086
chore(model gallery): add daichi-12b ( #5169 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-13 15:53:11 +02:00
Ettore Di Giacinto
89ca4bc02d
chore(model gallery): add hamanasu-magnum-4b-i1 ( #5168 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-13 14:37:59 +02:00
Ettore Di Giacinto
e626aa48a4
chore(model gallery): add hamanasu-adventure-4b-i1 ( #5167 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-13 14:35:57 +02:00
Ettore Di Giacinto
752b5e0339
chore(model gallery): add mag-picaro-72b ( #5166 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-13 14:34:14 +02:00
Ettore Di Giacinto
637d72d6e3
chore(model gallery): add lightthinker-qwen ( #5165 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-13 14:31:05 +02:00
LocalAI [bot]
f3bfec580a
chore: ⬆️ Update ggml-org/llama.cpp to bc091a4dc585af25c438c8473285a8cfec5c7695 ( #5158 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-13 08:23:41 +00:00
Ettore Di Giacinto
165c1ddff3
chore(model gallery): add tesslate_gradience-t1-3b-preview ( #5160 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-12 10:37:40 +02:00
Ettore Di Giacinto
fb83238e9e
chore(model gallery): add zyphra_zr1-1.5b ( #5157 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-11 10:06:05 +02:00
Ettore Di Giacinto
700bfa41c7
chore(model gallery): add agentica-org_deepcoder-1.5b-preview ( #5156 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-11 10:03:59 +02:00
LocalAI [bot]
25bdc350df
chore: ⬆️ Update ggml-org/llama.cpp to 64eda5deb9859e87a020e56bab5d2f9ca956f1de ( #5155 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-10 21:44:55 +00:00
Richard Palethorpe
1b899e1a68
feat(stablediffusion): Enable SYCL ( #5144 )
...
* feat(sycl): Enable SYCL for stable diffusion
This is a pain because we compile with CGO, but SD is compiled with
CMake. I don't think we can easily use CMake to set the linker flags
necessary. Also I could not find pkg-config calls that would fully set
the flags, so some of them are set manually.
See https://www.intel.com/content/www/us/en/developer/tools/oneapi/onemkl-link-line-advisor.html
for reference. I also resorted to searching the shared object files in
MKLROOT/lib for the symbols.
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(ci): Don't set nproc on cmake
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-10 15:20:53 +02:00
Ettore Di Giacinto
3bf13f8c69
chore(model gallery): add soob3123_amoral-cogito-v1-preview-qwen-14b ( #5154 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-10 10:07:56 +02:00
Ettore Di Giacinto
7a00729374
chore(model gallery): add trappu_magnum-picaro-0.7-v2-12b ( #5153 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-10 10:03:42 +02:00
Ettore Di Giacinto
d484028532
feat(diffusers): add support for Lumina2Text2ImgPipeline ( #4806 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-10 09:55:51 +02:00
LocalAI [bot]
0eb7fc2c41
chore: ⬆️ Update ggml-org/llama.cpp to d3bd7193ba66c15963fd1c59448f22019a8caf6e ( #5152 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-09 22:01:25 +00:00
Ettore Di Giacinto
a69e30e0c9
chore(model gallery): add agentica-org_deepcoder-14b-preview ( #5151 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-09 16:55:47 +02:00
Ettore Di Giacinto
9c018e6bff
chore(model gallery): add deepcogito_cogito-v1-preview-llama-70b ( #5150 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-09 16:54:59 +02:00
Ettore Di Giacinto
281e818047
chore(model gallery): add deepcogito_cogito-v1-preview-llama-70b ( #5150 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-09 16:53:28 +02:00
Ettore Di Giacinto
270f0e2157
chore(model gallery): add deepcogito_cogito-v1-preview-qwen-32b ( #5149 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-09 16:48:15 +02:00
Ettore Di Giacinto
673e59e76c
chore(model gallery): add deepcogito_cogito-v1-preview-llama-3b ( #5148 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-09 16:42:53 +02:00
LocalAI [bot]
5a8a2adb44
chore: ⬆️ Update ggml-org/llama.cpp to b32efad2bc42460637c3a364c9554ea8217b3d7f ( #5146 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-09 15:39:04 +02:00
Ettore Di Giacinto
a7317d23bf
chore(model gallery): add deepcogito_cogito-v1-preview-llama-8b ( #5147 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-09 10:02:09 +02:00
Ettore Di Giacinto
2bab9b5fe2
fix: fix gallery name for cogito-v1-preview-qwen-14B
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-08 22:15:32 +02:00
Ettore Di Giacinto
081be3ba7d
chore(model gallery): add cogito-v1-preview-qwen-14b ( #5145 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 22:04:14 +02:00
Ettore Di Giacinto
25e6f21322
chore(deps): bump llama.cpp to 4ccea213bc629c4eef7b520f7f6c59ce9bbdaca0 ( #5143 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 11:26:06 +02:00
Ettore Di Giacinto
b4df1c9cf3
fix(gemma): improve prompt for tool calls ( #5142 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 10:12:42 +02:00
Ettore Di Giacinto
4fbd6609f2
chore(model gallery): add meta-llama_llama-4-scout-17b-16e-instruct ( #5141 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 10:12:28 +02:00
Ettore Di Giacinto
7387932f89
chore(model gallery): add mensa-beta-14b-instruct-i1 ( #5140 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 10:01:24 +02:00
Ettore Di Giacinto
59c37e67b2
chore(model gallery): add eurydice-24b-v2-i1 ( #5139 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 09:56:29 +02:00
Ettore Di Giacinto
c09d227647
chore(model gallery): add watt-ai_watt-tool-70b ( #5138 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 09:42:49 +02:00
Ettore Di Giacinto
547d322b28
chore(model gallery): add arliai_qwq-32b-arliai-rpr-v ( #5137 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 09:40:26 +02:00
dependabot[bot]
a6f0bb410f
chore(deps): bump securego/gosec from 2.22.0 to 2.22.3 ( #5134 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.0 to 2.22.3.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.0...v2.22.3 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-04-07 21:09:45 +00:00
Ettore Di Giacinto
710f624ecd
fix(webui): improve model display, do not block view ( #5133 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-07 18:03:25 +02:00
LocalAI [bot]
5018452be7
chore: ⬆️ Update ggml-org/llama.cpp to 916c83bfe7f8b08ada609c3b8e583cf5301e594b ( #5130 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-06 21:51:51 +00:00
Ettore Di Giacinto
ece239966f
chore: ⬆️ Update ggml-org/llama.cpp to 6bf28f0111ff9f21b3c1b1eace20c590281e7ba6 ( #5127 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-06 14:01:51 +02:00
Ettore Di Giacinto
3b8bc7e64c
chore(model gallery): add open-thoughts_openthinker2-7b ( #5129 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-06 10:53:22 +02:00
Ettore Di Giacinto
fc73b2b430
chore(model gallery): add open-thoughts_openthinker2-32b ( #5128 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-06 10:48:21 +02:00
Ettore Di Giacinto
901dba6063
chore(model gallery): add gemma-3-27b-it-qat ( #5124 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-05 08:46:49 +02:00
LocalAI [bot]
b88a7a4550
chore: ⬆️ Update ggml-org/llama.cpp to 3e1d29348b5d77269f6931500dd1c1a729d429c8 ( #5123 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-04 21:49:53 +00:00
Ettore Di Giacinto
106e40845f
chore(model gallery): add katanemo_arch-function-chat-3b ( #5122 )
2025-04-04 10:45:44 +02:00
Ettore Di Giacinto
0064bec8f5
chore(model gallery): add katanemo_arch-function-chat-1.5b ( #5121 )
2025-04-04 10:31:44 +02:00
Ettore Di Giacinto
9e6dbb0b5a
chore(model gallery): add katanemo_arch-function-chat-7b ( #5120 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-04 10:29:47 +02:00
Ettore Di Giacinto
d26e61388b
chore(model gallery): add tesslate_synthia-s1-27b ( #5119 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-04 10:27:52 +02:00
Ettore Di Giacinto
31a7084c75
chore(model gallery): add gemma-3-4b-it-qat ( #5118 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-04 10:23:56 +02:00
Ettore Di Giacinto
128612a6fc
chore(model gallery): add gemma-3-12b-it-qat ( #5117 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-04 10:21:45 +02:00
LocalAI [bot]
6af3f46bc3
chore: ⬆️ Update ggml-org/llama.cpp to c262beddf29f3f3be5bbbf167b56029a19876956 ( #5116 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-03 22:59:49 +00:00
Richard Palethorpe
d2cf8ef070
fix(sycl): kernel not found error by forcing -fsycl ( #5115 )
...
* chore(sycl): Update oneapi to 2025:1
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(sycl): Pass -fsycl flag as workaround
-fsycl should be set by llama.cpp's cmake file, but something goes wrong
and it doesn't appear to get added
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(build): Speed up llama build by using all CPUs
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-03 16:22:59 +02:00
Ettore Di Giacinto
259ad3cfe6
chore(model gallery): add all-hands_openhands-lm-1.5b-v0.1 ( #5114 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-03 10:25:46 +02:00
Ettore Di Giacinto
18b320d577
chore(deps): bump llama.cpp to 'f01bd02376f919b05ee635f438311be8dfc91d7c ( #5110 )
...
chore(deps): bump llama.cpp to 'f01bd02376f919b05ee635f438311be8dfc91d7c'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-03 10:23:14 +02:00
Ettore Di Giacinto
89e151f035
chore(model gallery): add all-hands_openhands-lm-7b-v0.1 ( #5113 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-03 10:20:20 +02:00
Ettore Di Giacinto
22060f6410
chore(model gallery): add burtenshaw_gemmacoder3-12b ( #5112 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-03 10:17:57 +02:00
Ettore Di Giacinto
7ee3288460
chore(model gallery): add all-hands_openhands-lm-32b-v0.1 ( #5111 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-03 10:15:57 +02:00
LocalAI [bot]
cbbc954a8c
chore: ⬆️ Update ggml-org/llama.cpp to f423981ac806bf031d83784bcb47d2721bc70f97 ( #5108 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-02 09:22:53 +02:00
Ettore Di Giacinto
2c425e9c69
feat(loader): enhance single active backend by treating as singleton ( #5107 )
...
feat(loader): enhance single active backend by treating at singleton
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-01 20:58:11 +02:00
LocalAI [bot]
c59975ab05
chore: ⬆️ Update ggml-org/llama.cpp to c80a7759dab10657b9b6c3e87eef988a133b9b6a ( #5105 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-01 00:01:34 +02:00
Ettore Di Giacinto
05f7004487
fix: race during stop of active backends ( #5106 )
...
* chore: drop double call to stop all backends, refactors
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: do lock when cycling to models to delete
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-01 00:01:10 +02:00
Ettore Di Giacinto
2f9203cd2a
chore: drop remoteLibraryURL from kong vars ( #5103 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-31 22:48:17 +02:00
LocalAI [bot]
f09b33f2ef
docs: ⬆️ update docs version mudler/LocalAI ( #5104 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-31 22:48:03 +02:00
Ettore Di Giacinto
65470b0ab1
Update README
2025-03-31 21:51:09 +02:00
Ettore Di Giacinto
9a23fe662b
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-03-31 19:35:34 +02:00
LocalAI [bot]
6d7ac09e96
chore: ⬆️ Update ggml-org/llama.cpp to 4663bd353c61c1136cd8a97b9908755e4ab30cec ( #5100 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-30 21:59:30 +00:00
Ettore Di Giacinto
c2a39e3639
fix(llama.cpp): properly handle sigterm ( #5099 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-30 18:08:29 +02:00
Ettore Di Giacinto
ae625a4d00
chore(model gallery): add hammer2.0-7b ( #5098 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-30 09:50:21 +02:00
Ettore Di Giacinto
7f3a029596
chore(model gallery): add forgotten-abomination-70b-v5.0 ( #5097 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-30 09:48:24 +02:00
Ettore Di Giacinto
b34cf00819
chore(model gallery): add galactic-qwen-14b-exp1 ( #5096 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-30 09:46:51 +02:00
LocalAI [bot]
d4a10b4300
chore: ⬆️ Update ggml-org/llama.cpp to 0bb2919335d00ff0bc79d5015da95c422de51f03 ( #5095 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-29 21:40:45 +00:00
Ettore Di Giacinto
9c74d74f7b
feat(gguf): guess default context size from file ( #5089 )
...
feat(gguf): guess default config file from files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-29 14:42:14 +01:00
Ettore Di Giacinto
679ee7bea4
chore(model gallery): add chaoticneutrals_very_berry_qwen2_7b ( #5093 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-29 12:34:49 +01:00
Ettore Di Giacinto
77d7dc62c4
chore(model gallery): add tesslate_tessa-t1-3b ( #5092 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-29 12:15:28 +01:00
Ettore Di Giacinto
699519d1fe
chore(model gallery): add tesslate_tessa-t1-7b ( #5091 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-29 12:12:01 +01:00
Ettore Di Giacinto
8faf39d34e
chore(model gallery): add tesslate_tessa-t1-14b ( #5090 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-29 11:58:39 +01:00
Ettore Di Giacinto
5d261a6fcd
chore(model gallery): add tesslate_tessa-t1-32b ( #5088 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-29 11:53:47 +01:00
Ettore Di Giacinto
22d5727089
chore(model gallery): add tarek07_legion-v2.1-llama-70b ( #5087 )
2025-03-29 11:27:06 +01:00
LocalAI [bot]
c965197d6f
chore: ⬆️ Update ggml-org/llama.cpp to b4ae50810e4304d052e630784c14bde7e79e4132 ( #5085 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-28 21:37:18 +00:00
Ettore Di Giacinto
994a6c4939
chore(model gallery): fallen-safeword-70b-r1-v4.1 ( #5084 )
2025-03-28 15:20:38 +01:00
Ettore Di Giacinto
f926d2a72b
chore(model gallery): thoughtless-fallen-abomination-70b-r1-v4.1-i1 ( #5083 )
2025-03-28 15:11:54 +01:00
Ettore Di Giacinto
ddeb9ed93e
chore(model gallery): qwen2.5-14b-instruct-1m-unalign-i1 ( #5082 )
2025-03-28 15:08:33 +01:00
Ettore Di Giacinto
c7e99c7b59
chore(model gallery): gemma-3-starshine-12b-i1 ( #5081 )
2025-03-28 14:50:39 +01:00
Ettore Di Giacinto
6fabc92e56
chore(model gallery): add soob3123_amoral-gemma3-12b-v2 ( #5080 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-28 14:45:02 +01:00
LocalAI [bot]
4645b3c919
chore: ⬆️ Update ggml-org/llama.cpp to 5dec47dcd411fdf815a3708fd6194e2b13d19006 ( #5079 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-27 23:32:33 +00:00
Dave
134fe2705c
fix: ensure git-lfs is present ( #5078 )
...
devcontainer clean builds had issue with git-lfs -- should this be installed for _all_ images for safety?
Signed-off-by: Dave Lee <dave@gray101.com >
2025-03-27 22:23:28 +01:00
LocalAI [bot]
3cca32ba7e
chore: ⬆️ Update ggml-org/llama.cpp to b3298fa47a2d56ae892127ea038942ab1cada190 ( #5077 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-27 10:47:07 +01:00
Ettore Di Giacinto
c069e61b26
chore(model gallery): add textsynth-8b-i1 ( #5076 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-26 14:40:19 +01:00
Ettore Di Giacinto
7fa159e164
chore(model gallery): add blacksheep-24b-i1 ( #5075 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-26 14:37:30 +01:00
Ettore Di Giacinto
5f92025617
chore(model gallery): add gemma-3-glitter-12b-i1 ( #5074 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-26 10:19:00 +01:00
LocalAI [bot]
333e1bc732
chore: ⬆️ Update ggml-org/llama.cpp to ef19c71769681a0b3dde6bc90911728376e5d236 ( #5073 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-26 09:51:20 +01:00
Ettore Di Giacinto
e90b97c144
chore(model gallery): add alamios_mistral-small-3.1-draft-0.5b ( #5071 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-25 10:10:45 +01:00
Ettore Di Giacinto
747eeb1d46
chore(model gallery): add helpingai_helpingai3-raw ( #5070 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-25 10:09:00 +01:00
Ettore Di Giacinto
5d2c53abc0
chore(model gallery): add jdineen_llama-3.1-8b-think ( #5069 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-25 10:06:24 +01:00
LocalAI [bot]
0b1e721242
chore: ⬆️ Update ggml-org/llama.cpp to c95fa362b3587d1822558f7e28414521075f254f ( #5068 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-24 21:37:16 +00:00
Ettore Di Giacinto
8c76a9ce99
chore(model gallery): add dusk_rainbow ( #5066 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-24 09:49:32 +01:00
Ettore Di Giacinto
338321af5b
chore(model gallery): add eximius_persona_5b ( #5065 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-24 09:30:20 +01:00
Ettore Di Giacinto
2774a92484
chore(model gallery): add impish_llama_3b ( #5064 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-24 09:27:04 +01:00
LocalAI [bot]
1a6bfb41a1
chore: ⬆️ Update ggml-org/llama.cpp to 77f9c6bbe55fccd9ea567794024cb80943947901 ( #5062 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-23 21:37:14 +00:00
Ettore Di Giacinto
314981eaf8
chore(model gallery): add fiendish_llama_3b ( #5061 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 10:00:19 +01:00
Ettore Di Giacinto
d7266c633d
chore(model gallery): add sicariussicariistuff_x-ray_alpha ( #5060 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:56:35 +01:00
Ettore Di Giacinto
eb4d5f2b95
chore(model gallery): add mawdistical_mawdistic-nightlife-24b ( #5059 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:52:50 +01:00
Ettore Di Giacinto
c63b449ad6
chore(model gallery): add huihui-ai_gemma-3-1b-it-abliterated ( #5058 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:35:05 +01:00
Ettore Di Giacinto
dd4a778c2c
chore(model gallery): add thedrummer_fallen-gemma3-27b-v1 ( #5057 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:32:58 +01:00
Ettore Di Giacinto
a0896d21d6
chore(model gallery): add thedrummer_fallen-gemma3-12b-v1 ( #5056 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:31:37 +01:00
Ettore Di Giacinto
0e697f951a
chore(model gallery): add thedrummer_fallen-gemma3-4b-v1 ( #5055 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:30:17 +01:00
Ettore Di Giacinto
fa4bb9082d
chore(model gallery): add knoveleng_open-rs3 ( #5054 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:27:27 +01:00
LocalAI [bot]
8ff7b15441
chore: ⬆️ Update ggml-org/llama.cpp to ba932dfb50cc694645b1a148c72f8c06ee080b17 ( #5053 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-22 22:18:55 +00:00
LocalAI [bot]
dd45f85a20
chore: ⬆️ Update ggml-org/llama.cpp to 4375415b4abf94fb36a5fd15f233ac0ee23c0bd1 ( #5052 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-21 21:36:25 +00:00
Ettore Di Giacinto
decdd9e522
chore(model gallery): add luvgpt_phi3-uncensored-chat ( #5051 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-21 09:11:07 +01:00
Ettore Di Giacinto
31a21d4a2c
chore(model gallery): add sao10k_llama-3.3-70b-vulpecula-r1 ( #5050 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-21 09:08:55 +01:00
Ettore Di Giacinto
2c129843a7
chore(model gallery): add qwen-writerdemo-7b-s500-i1 ( #5049 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-21 09:03:39 +01:00
LocalAI [bot]
ce71a0bcfb
chore: ⬆️ Update ggml-org/llama.cpp to e04643063b3d240b8c0fdba98677dff6ba346784 ( #5047 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-20 21:34:51 +00:00
Ettore Di Giacinto
0a32c38317
chore(model gallery): add basic function template for gemma
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-20 09:32:21 +01:00
Ettore Di Giacinto
36f596f260
chore(model gallery): add soob3123_amoral-gemma3-4b ( #5046 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-20 09:30:04 +01:00
Ettore Di Giacinto
953552545b
chore(model gallery): add samsungsailmontreal_bytecraft ( #5045 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-20 09:27:33 +01:00
Ettore Di Giacinto
835e55b1de
chore(model gallery): add rootxhacker_apollo-v3-32b ( #5044 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-20 09:20:42 +01:00
Ettore Di Giacinto
dcd2921eaa
chore(model gallery): add gemma-3-4b-it-uncensored-dbl-x-i1 ( #5043 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-20 09:17:20 +01:00
LocalAI [bot]
5e6459fd18
chore: ⬆️ Update ggml-org/llama.cpp to 568013d0cd3d5add37c376b3d5e959809b711fc7 ( #5042 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-19 21:47:18 +00:00
Ettore Di Giacinto
50ddb3eb59
chore(model gallery): add nvidia_llama-3_3-nemotron-super-49b-v1 ( #5041 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-19 09:37:27 +01:00
Ettore Di Giacinto
5eebfee4b5
chore(model gallery): add gryphe_pantheon-rp-1.8-24b-small-3.1 ( #5040 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-19 09:32:47 +01:00
Ettore Di Giacinto
567919ea90
chore(model gallery): add mistralai_mistral-small-3.1-24b-instruct-2503 ( #5039 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-19 09:29:23 +01:00
LocalAI [bot]
27a3997530
chore(model-gallery): ⬆️ update checksum ( #5036 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-19 09:18:40 +01:00
LocalAI [bot]
192ba2c657
chore: ⬆️ Update ggml-org/llama.cpp to d84635b1b085d54d6a21924e6171688d6e3dfb46 ( #5035 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-18 22:23:39 +00:00
Ettore Di Giacinto
92abac9ca8
chore(model gallery): add soob3123_amoral-gemma3-12b ( #5034 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-18 09:38:05 +01:00
Ettore Di Giacinto
04ebbbd73a
chore(model gallery): add mlabonne_gemma-3-4b-it-abliterated ( #5033 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-18 09:36:14 +01:00
Ettore Di Giacinto
55305e0d95
chore(model gallery): add mlabonne_gemma-3-12b-it-abliterated ( #5032 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-18 09:32:41 +01:00
Ettore Di Giacinto
67623639e4
chore(model gallery): add mlabonne_gemma-3-27b-it-abliterated ( #5031 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-18 09:30:25 +01:00
LocalAI [bot]
cc76def342
chore: ⬆️ Update ggml-org/llama.cpp to b1b132efcba216c873715c483809730bb253f4a1 ( #5029 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-17 21:43:15 +00:00
Ettore Di Giacinto
4967fa5928
chore(model gallery): disable gemma3 mmproj
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-17 12:34:21 +01:00
Ettore Di Giacinto
2b98e4ec56
chore(model gallery): update gemma3 URLs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-17 12:22:35 +01:00
Ettore Di Giacinto
fa1d058ee2
chore(model gallery): add mproj files for gemma3 models ( #5028 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-17 12:11:46 +01:00
Ettore Di Giacinto
a49a588bfa
chore(model gallery): add readyart_forgotten-safeword-70b-3.6 ( #5027 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-17 11:50:34 +01:00
LocalAI [bot]
ca7dda61c6
chore: ⬆️ Update ggml-org/llama.cpp to 8ba95dca2065c0073698afdfcda4c8a8f08bf0d9 ( #5026 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-16 21:42:17 +00:00
Ettore Di Giacinto
ffedddd76d
chore(model gallery): add beaverai_mn-2407-dsk-qwqify-v0.1-12b ( #5024 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-16 09:33:19 +01:00
Ettore Di Giacinto
766c76ae8e
chore(model gallery): add pocketdoc_dans-sakurakaze-v1.0.0-12b ( #5023 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-16 09:29:48 +01:00
LocalAI [bot]
3096ff33e9
chore: ⬆️ Update ggml-org/llama.cpp to f4c3dd5daa3a79f713813cf1aabdc5886071061d ( #5022 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-15 21:43:48 +00:00
Ettore Di Giacinto
90a7451da4
chore(model gallery): add allura-org_bigger-body-70b ( #5021 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-15 14:43:51 +01:00
LocalAI [bot]
529a4b9ee8
chore: ⬆️ Update ggml-org/llama.cpp to 9f2250ba722738ec0e6ab684636268a79160c854 ( #5019 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-14 21:45:54 +00:00
Ettore Di Giacinto
0567e104eb
chore(model gallery): add eurollm-9b-instruct ( #5017 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-14 09:25:44 +01:00
Ettore Di Giacinto
ecbeacd022
chore(model gallery): add prithivmlmods_viper-coder-32b-elite13 ( #5016 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-14 09:20:27 +01:00
Ettore Di Giacinto
2772960e41
chore(model gallery): add nousresearch_deephermes-3-llama-3-3b-preview ( #5015 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-14 09:16:17 +01:00
Ettore Di Giacinto
1b694191e2
chore(model gallery): add nousresearch_deephermes-3-mistral-24b-preview ( #5014 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-14 09:13:27 +01:00
Ettore Di Giacinto
69578a5f8f
chore(model gallery): add models/qgallouedec_gemma-3-27b-it-codeforces-sft ( #5013 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-14 09:11:13 +01:00
LocalAI [bot]
7d96cfe72b
chore: ⬆️ Update ggml-org/llama.cpp to 84d547554123a62e9ac77107cb20e4f6cc503af4 ( #5011 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-13 22:30:17 +00:00
Ettore Di Giacinto
423514a5a5
fix(clip): do not imply GPU offload by default ( #5010 )
...
* fix(clip): do not imply GPUs by default
Until a better solution is found upstream, be conservative and default
to GPU.
https://github.com/ggml-org/llama.cpp/pull/12322
https://github.com/ggml-org/llama.cpp/pull/12322#issuecomment-2720970695
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* allow to override gpu via backend options
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-13 15:14:11 +01:00
Ettore Di Giacinto
12568c7d6d
chore(model gallery): add gemma-3-1b-it ( #5009 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-13 09:48:40 +01:00
Ettore Di Giacinto
8d16a0a536
chore(model gallery): add gemma-3-4b-it ( #5008 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-13 09:47:01 +01:00
Ettore Di Giacinto
87ca801f00
chore(model gallery): add gemma-3-12b-it ( #5007 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-13 09:44:49 +01:00
Ettore Di Giacinto
e4ecbb6c30
chore(model gallery): add gemma-3-27b-it ( #5003 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-13 08:28:28 +01:00
LocalAI [bot]
b1a67de2b9
chore: ⬆️ Update ggml-org/llama.cpp to f08f4b3187b691bb08a8884ed39ebaa94e956707 ( #5006 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-13 01:01:30 +00:00
LocalAI [bot]
71a23910fe
chore: ⬆️ Update ggml-org/llama.cpp to 80a02aa8588ef167d616f76f1781b104c245ace0 ( #5004 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-12 16:26:09 +00:00
LocalAI [bot]
0ede31f9cf
chore: ⬆️ Update ggml-org/llama.cpp to 10f2e81809bbb69ecfe64fc8b4686285f84b0c07 ( #4996 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-12 14:13:04 +00:00
Ettore Di Giacinto
9f5dcf2d1e
feat(aio): update AIO image defaults ( #5002 )
...
* feat(aio): update AIO image defaults
cpu:
- text-to-text: llama3.1
- embeddings: granite-embeddings
- vision: moonream2
gpu/intel:
- text-to-text: localai-functioncall-qwen2.5-7b-v0.5
- embeddings: granite-embeddings
- vision: minicpm
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(aio): use minicpm as moondream2 stopped working
https://github.com/ggml-org/llama.cpp/pull/12322#issuecomment-2717483759
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-12 12:55:06 +01:00