Ettore Di Giacinto
47ccfccf7a
fix(ci): add nvidia-l4t capability to l4t images ( #5914 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-25 22:45:09 +02:00
LocalAI [bot]
a760f7ff39
docs: ⬆️ update docs version mudler/LocalAI ( #5912 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 22:15:16 +02:00
Ettore Di Giacinto
facf7625f3
fix(vulkan): use correct image suffix ( #5911 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 19:20:20 +02:00
Ettore Di Giacinto
b3600b3c50
feat(backend gallery): add mirrors ( #5910 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 19:20:08 +02:00
Ettore Di Giacinto
f0b47cfe6a
fix(backends gallery): trim string when reading cap from file ( #5909 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 18:10:02 +02:00
Ettore Di Giacinto
ee625fc34e
fix(backends gallery): pass-by backend galleries to the model service ( #5906 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 16:38:09 +02:00
Ettore Di Giacinto
693aa0b5de
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-25 11:51:23 +02:00
Ettore Di Giacinto
3973e6e5da
fix(install.sh): update to use the new binary naming ( #5903 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 10:43:22 +02:00
LocalAI [bot]
fb6ec68090
chore: ⬆️ Update ggml-org/whisper.cpp to 7de8dd783f7b2eab56bff6bbc5d3369e34f0e77f ( #5902 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 08:40:24 +02:00
LocalAI [bot]
0301fc7c46
chore: ⬆️ Update leejet/stable-diffusion.cpp to eed97a5e1d054f9c1e7ac01982ae480411d4157e ( #5901 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 08:40:06 +02:00
LocalAI [bot]
813cb4296d
chore: ⬆️ Update ggml-org/llama.cpp to 3f4fc97f1d745f1d5d3c853949503136d419e6de ( #5900 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 08:39:44 +02:00
Ettore Di Giacinto
deda3a4972
Update build documentation
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-24 22:53:08 +02:00
Ettore Di Giacinto
a28f27604a
Update backends.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-24 16:18:25 +02:00
Richard Palethorpe
8fe9fa98f2
fix(stablediffusion-cpp): Switch back to upstream and update ( #5880 )
...
* sync(stablediffusion-cpp): Switch back to upstream and update
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(stablediffusion-ggml): NULL terminate options array to prevent segfault
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(build): Add BUILD_TYPE and BASE_IMAGE to all backends
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-24 16:03:18 +02:00
Nathaniel Hyson
4db1b80278
Update quickstart.md ( #5898 )
...
Fixed spelling mistake
Signed-off-by: Nathaniel Hyson <Shinrai@users.noreply.github.com >
2025-07-24 15:04:02 +02:00
Dave
b3c2a3c257
fix: untangle pkg and core ( #5896 )
...
* migrate core/system to pkg/system - it has no dependencies FROM core, and IS USED in pkg
Signed-off-by: Dave Lee <dave@gray101.com >
* move pkg/templates up to core/templates -- nothing in pkg references it, but it does reference core.
Signed-off-by: Dave Lee <dave@gray101.com >
* remove extra check, len of nil is 0
Signed-off-by: Dave Lee <dave@gray101.com >
* move pkg/startup to core/startup -- it does have important and unfixable dependencies on core
Signed-off-by: Dave Lee <dave@gray101.com >
---------
Signed-off-by: Dave Lee <dave@gray101.com >
2025-07-24 15:03:41 +02:00
LocalAI [bot]
61c2304638
chore: ⬆️ Update ggml-org/llama.cpp to a86f52b2859dae4db5a7a0bbc0f1ad9de6b43ec6 ( #5894 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-24 15:02:37 +02:00
Ettore Di Giacinto
92c5ab97e2
chore(Makefile): drop unused targets ( #5893 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-24 14:49:50 +02:00
LocalAI [bot]
76e471441c
chore: ⬆️ Update richiejp/stable-diffusion.cpp to 10c6501bd05a697e014f1bee3a84e5664290c489 ( #5732 )
...
⬆️ Update richiejp/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-23 21:09:02 +00:00
Dave
9cecf5e7ac
fix: rename Dockerfile.go --> Dockerfile.golang to avoid IDE errors ( #5892 )
...
extract up and out Dockerfile.go --> Dockerfile.golang rename. Prevents syntax highlighting and IDE errors
Signed-off-by: Dave Lee <dave@gray101.com >
2025-07-23 21:33:26 +02:00
Ettore Di Giacinto
b7b3164736
chore: try to speedup build
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 21:21:23 +02:00
Ettore Di Giacinto
5f7ece3e94
fix(p2p): adapt to backend changes, general improvements ( #5889 )
...
The binary is now named "llama-cpp-rpc-server" for p2p workers.
We also decrease the default token rotation interval, in this way
peer discovery is much more responsive.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 12:40:32 +02:00
Ettore Di Giacinto
c717b8d800
chore(model gallery): add qwen3-coder-480b-a35b-instruct ( #5888 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:59:58 +02:00
Ettore Di Giacinto
f1d35c4149
chore(model gallery): add qwen3-235b-a22b-instruct-2507 ( #5887 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:54:58 +02:00
Ettore Di Giacinto
ee7e77b6c1
chore(model gallery): add menlo_lucy ( #5886 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:51:51 +02:00
Ettore Di Giacinto
324fecbb75
chore(model gallery): add entfane_math-genius-7b ( #5885 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:45:23 +02:00
Ettore Di Giacinto
a79bfcf0a7
chore(model gallery): add dream-org_dream-v0-instruct-7b ( #5884 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:40:53 +02:00
Ettore Di Giacinto
82495e7fb6
chore(model gallery): add omega-qwen3-atom-8b ( #5883 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:33:43 +02:00
Ettore Di Giacinto
6030b12283
chore(backend gallery): add name to 'diffusers' meta
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 09:21:04 +02:00
LocalAI [bot]
b5be867e28
chore: ⬆️ Update ggml-org/llama.cpp to acd6cb1c41676f6bbb25c2a76fa5abeb1719301e ( #5882 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-22 21:12:06 +00:00
Ettore Di Giacinto
9b806250d4
chore: drop vllm for cuda 11 ( #5881 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-22 18:47:31 +02:00
Ettore Di Giacinto
5f066e702f
fix(darwin): add dashes on image suffix
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-22 17:08:19 +02:00
dependabot[bot]
47bb3a3db2
chore(deps): bump securego/gosec from 2.22.5 to 2.22.7 ( #5878 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.5 to 2.22.7.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.5...v2.22.7 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.7
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-07-22 16:42:11 +02:00
Richard Palethorpe
51230a801e
fix(build): Add and update ONEAPI_VERSION ( #5874 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-22 16:41:49 +02:00
Richard Palethorpe
754bedc3ea
fix(realtime): Reset speech started flag on commit ( #5879 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-22 16:41:12 +02:00
Ettore Di Giacinto
98e5291afc
feat: refactor build process, drop embedded backends ( #5875 )
...
* feat: split remaining backends and drop embedded backends
- Drop silero-vad, huggingface, and stores backend from embedded
binaries
- Refactor Makefile and Dockerfile to avoid building grpc backends
- Drop golang code that was used to embed backends
- Simplify building by using goreleaser
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(gallery): be specific with llama-cpp backend templates
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(docs): update
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(ci): minor fixes
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: drop all ffmpeg references
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: run protogen-go
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Always enable p2p mode
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update gorelease file
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(stores): do not always load
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix linting issues
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Simplify
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Mac OS fixup
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-22 16:31:04 +02:00
LocalAI [bot]
e29b2c3aff
chore: ⬆️ Update ggml-org/llama.cpp to 6c9ee3b17e19dcc82ab93d52ae46fdd0226d4777 ( #5877 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-22 08:25:43 +02:00
LocalAI [bot]
8dc574f3c4
chore: ⬆️ Update ggml-org/whisper.cpp to 1f5cf0b2888402d57bb17b2029b2caa97e5f3baf ( #5876 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-22 08:25:13 +02:00
Ettore Di Giacinto
05bf2493a5
fix: do not pass by environ to ffmpeg ( #5871 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-21 14:35:33 +02:00
Max Goltzsche
eae4ca08da
feat(openai): support input_audio chat api field ( #5870 )
...
Improving the chat completion endpoint OpenAI API compatibility by supporting messages of type `input_audio`, e.g.:
```
{
...
"messages": [
{
"role": "user",
"content": [{
"type": "input_audio",
"input_audio": {
"data": "<base64-encoded audio data>",
"format": "wav"
}
}]
}
]
}
```
Closes #5869
Signed-off-by: Max Goltzsche <max.goltzsche@gmail.com >
2025-07-21 09:15:55 +02:00
LocalAI [bot]
fa284f7445
chore: ⬆️ Update ggml-org/llama.cpp to 2be60cbc2707359241c2784f9d2e30d8fc7cdabb ( #5867 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-21 09:14:09 +02:00
Ettore Di Giacinto
8f69b80520
Update index.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-20 22:54:12 +02:00
Ettore Di Giacinto
b1fc5acd4a
feat: split whisper from main binary ( #5863 )
...
* feat: split whisper from main binary
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Cleanup makefile
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add backend builds (missing only darwin)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Test CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add whisper backend to test runs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make sure we have runtime libs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Less grpc on the main Dockerfile
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix hipblas build
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add whisper to index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Re-enable CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt auto-bumper
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-20 22:52:45 +02:00
LocalAI [bot]
fab41c29dd
chore(model-gallery): ⬆️ update checksum ( #5865 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-20 20:37:43 +02:00
Ettore Di Giacinto
fb0ec96396
ci: do not upgrade pip
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-20 12:30:12 +02:00
LocalAI [bot]
7659461036
chore: ⬆️ Update ggml-org/llama.cpp to a979ca22db0d737af1e548a73291193655c6be99 ( #5862 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-20 08:43:36 +02:00
Ettore Di Giacinto
580687da46
feat: remove stablediffusion-ggml from main binary ( #5861 )
...
* feat: split stablediffusion-ggml from main binary
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Test CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt ci tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to support nvidial4t
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Latest fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-19 21:58:53 +02:00
LocalAI [bot]
1929eb2894
chore: ⬆️ Update ggml-org/llama.cpp to bf9087f59aab940cf312b85a67067ce33d9e365a ( #5860 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-19 08:52:07 +02:00
Ettore Di Giacinto
b29544d747
feat: split piper from main binary ( #5858 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-19 08:31:33 +02:00
Ettore Di Giacinto
7c30e82647
fix: autoload backends when installing models from YAML files ( #5859 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-18 21:31:02 +02:00
Dedy F. Setyawan
a1d061c835
fix(docs): Resolve logo overlap on tablet view ( #5853 )
...
* fix(docs): Resolve logo overlap on tablet view
Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com >
* fix(docs): Adjust header logo size
Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com >
* refactor(docs): Rework header logo sizing implementation
Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com >
---------
Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com >
2025-07-18 15:55:44 +02:00
Sijia Lu
851c67019c
fix: dockerfile typo ( #5823 )
...
fix dockerfile typo
Signed-off-by: LeonSijiaLu <leonsijialu1@gmail.com >
2025-07-18 14:59:33 +02:00
Ettore Di Giacinto
53ed5ef189
Makefile fixup
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-18 14:57:14 +02:00
Ettore Di Giacinto
294f7022f3
feat: do not bundle llama-cpp anymore ( #5790 )
...
* Build llama.cpp separately
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Start to try to attach some tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add git and small fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: correctly autoload external backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to run AIO tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Slightly update the Makefile helps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt auto-bumper
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to run linux test
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add llama-cpp into build pipelines
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add default capability (for cpu)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop llama-cpp specific logic from the backend loader
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* drop grpc install in ci for tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Pass by backends path for tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Build protogen at start
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(tests): set backends path consistently
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Correctly configure the backends path
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to build for darwin
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Compile for metal on arm64/darwin
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to run build off from cross-arch
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add to the backend index nvidia-l4t and cpu's llama-cpp backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Build also darwin-x86 for llama-cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Disable arm64 builds temporary
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Test backend build on PR
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixup build backend reusable workflow
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* pass by skip drivers
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use crane
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Skip drivers
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* x86 darwin
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add packaging step for llama.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix leftover from bark-cpp extraction
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to fix hipblas build
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-18 13:24:12 +02:00
Richard Palethorpe
932f6b01a6
feat(realtime): Add speech started and stopped events ( #5856 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-18 09:22:23 +02:00
LocalAI [bot]
e96452c5d4
chore: ⬆️ Update ggml-org/llama.cpp to d6fb3f6b49b27ef1c0f4cf5128e041f7e7dc03af ( #5857 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-17 22:56:40 +00:00
Ettore Di Giacinto
5fc8d5bb78
fix: explorer page should not have login ( #5855 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-17 10:54:03 +02:00
LocalAI [bot]
121937ed6f
chore: ⬆️ Update ggml-org/llama.cpp to 496957e1cbcb522abc63aa18521036e40efce985 ( #5854 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-16 22:52:10 +00:00
LocalAI [bot]
2e38f2a054
chore: ⬆️ Update ggml-org/llama.cpp to 4a4f426944e79b79e389f9ed7b34831cb9b637ad ( #5852 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-15 22:56:04 +00:00
LocalAI [bot]
2a6187bc01
chore: ⬆️ Update ggml-org/llama.cpp to bdca38376f7e8dd928defe01ce6a16218a64b040 ( #5850 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-15 08:16:01 +02:00
LocalAI [bot]
584c48df5a
chore: ⬆️ Update ggml-org/whisper.cpp to 032697b9a850dc2615555e2a93a683cc3dd58559 ( #5849 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-14 22:47:30 +00:00
Ettore Di Giacinto
8dd67748a1
chore(model gallery): add sophosympatheia_strawberrylemonade-70b-v1.1 ( #5848 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-14 15:38:41 +02:00
Ettore Di Giacinto
3fd0bf3c88
chore(model gallery): add zhi-create-qwen3-32b-i1 ( #5847 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-14 15:33:40 +02:00
LocalAI [bot]
4062a6c404
chore: ⬆️ Update ggml-org/llama.cpp to 982e347255723fe6d02e60ee30cfdd0559c884c5 ( #5845 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-14 08:21:54 +02:00
Ettore Di Giacinto
354c0b763e
feat(cli): add command to create custom OCI images from directories ( #5844 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-14 08:21:29 +02:00
LocalAI [bot]
40f9065367
chore: ⬆️ Update ggml-org/whisper.cpp to a16da91365700f396da916d16a7f5a2ec99364b9 ( #5846 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-13 22:46:03 +00:00
Ettore Di Giacinto
fc02bc0aba
chore(model gallery): add google_medgemma-27b-it ( #5843 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-13 18:20:21 +02:00
Ettore Di Giacinto
45badb75e8
chore(model gallery): add google_medgemma-4b-it ( #5842 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-13 17:56:44 +02:00
LocalAI [bot]
d7e1922582
chore: ⬆️ Update ggml-org/whisper.cpp to 3775c503d5133d3d8b99d7d062e87a54064b0eb8 ( #5841 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-13 08:49:15 +02:00
LocalAI [bot]
642a39afa0
chore: ⬆️ Update ggml-org/llama.cpp to c31e60647def83d671bac5ab5b35579bf25d9aa1 ( #5840 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-12 22:44:45 +00:00
Ettore Di Giacinto
34d9deaf39
chore(model gallery): add impish_magic_24b-i1 ( #5839 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-12 19:37:10 +02:00
Ettore Di Giacinto
ef37a73e1b
chore(model gallery): add mistral-2x24b-moe-power-coder-magistral-devstral-reasoning-ultimate-neo-max-44b ( #5838 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-12 19:27:46 +02:00
Ettore Di Giacinto
37de945ae8
chore(model gallery): add nvidia_llama-3_3-nemotron-super-49b-genrm-multilingual ( #5837 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-12 19:21:40 +02:00
LocalAI [bot]
468f1f4539
chore: ⬆️ Update ggml-org/llama.cpp to f5e96b368f1acc7f53c390001b936517c4d18999 ( #5835 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-11 22:46:25 +00:00
Ettore Di Giacinto
0640451368
chore(model gallery): add mistralai_devstral-small-2507 ( #5834 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-11 11:51:11 +02:00
Ettore Di Giacinto
99058511cc
chore(model gallery): add huihui-ai_huihui-gemma-3n-e4b-it-abliterated ( #5833 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-11 11:42:01 +02:00
Ettore Di Giacinto
ec293b3b59
chore(model gallery): add microsoft_nextcoder-32b ( #5832 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-11 11:36:11 +02:00
LocalAI [bot]
9b1b6df8e9
chore: ⬆️ Update ggml-org/llama.cpp to 0b8855775c6b873931d40b77a5e42558aacbde52 ( #5830 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-10 22:48:03 +00:00
Ettore Di Giacinto
cd7fbafcd2
chore(model gallery): add thedrummer_tiger-gemma-12b-v3 ( #5827 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-10 14:19:41 +02:00
Ettore Di Giacinto
e5125216cf
chore(model gallery): add thedrummer_big-tiger-gemma-27b-v3 ( #5826 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-10 14:17:34 +02:00
Ettore Di Giacinto
2105f82433
chore(model gallery): add delta-vector_plesio-70b ( #5825 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-10 14:13:14 +02:00
Ettore Di Giacinto
49c0c7881a
chore(model gallery): add huggingfacetb_smollm3-3b ( #5820 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-09 18:27:52 +02:00
Ettore Di Giacinto
f8829376d8
chore(model gallery): add zerofata_l3.3-geneticlemonade-opus-70b ( #5819 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-09 18:23:01 +02:00
Ettore Di Giacinto
0475f63675
chore(model gallery): add lyranovaheart_starfallen-snow-fantasy-24b-ms3.2-v0.0 ( #5818 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-09 18:20:16 +02:00
Ettore Di Giacinto
ec206cc67c
feat(cli): allow to install backends from OCI tar files ( #5816 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-09 18:19:51 +02:00
LocalAI [bot]
34171fcf94
chore: ⬆️ Update ggml-org/llama.cpp to 6efcd65945a98cf6883cdd9de4c8ccd8c79d219a ( #5817 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-08 22:46:10 +00:00
LocalAI [bot]
238c334aa7
chore: ⬆️ Update ggml-org/whisper.cpp to 869335f2d58d04010535be9ae23a69a9da12a169 ( #5809 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-08 17:46:08 +02:00
Ettore Di Giacinto
d2df0a1769
chore(model gallery): add qwen3-8b-shiningvaliant3 ( #5815 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-08 13:17:43 +02:00
Ettore Di Giacinto
d58647ac31
chore(model gallery): add ockerman0_anubislemonade-70b-v1.1 ( #5814 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-08 13:14:46 +02:00
Ettore Di Giacinto
c1d3ce9a93
chore(model gallery): add cognitivecomputations_dolphin-mistral-24b-venice-edition ( #5813 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-08 13:09:29 +02:00
Richard Palethorpe
c1dd4ff5d5
feat(whisper): Enable SYCL ( #5802 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-08 12:54:20 +02:00
LocalAI [bot]
48118b9582
chore: ⬆️ Update ggml-org/llama.cpp to 12f55c302b35cfe900b84c5fe67c262026af9c44 ( #5808 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-07 22:50:16 +00:00
Ettore Di Giacinto
ceda2e69db
chore(model gallery): add huihui-jan-nano-abliterated ( #5806 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-07 11:35:39 +02:00
Ettore Di Giacinto
cea1703acc
chore(model gallery): add zonui-3b-i1 ( #5805 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-07 11:32:58 +02:00
Ettore Di Giacinto
33fc9b9922
chore(model gallery): add mini-hydra ( #5804 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-07 11:27:42 +02:00
Ettore Di Giacinto
b783997c52
chore(model gallery): add compumacy-experimental-32b ( #5803 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-07 11:21:44 +02:00
LocalAI [bot]
f6ec06d21c
chore: ⬆️ Update ggml-org/llama.cpp to 6491d6e4f1caf0ad2221865b4249ae6938a6308c ( #5801 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-06 22:45:50 +00:00
Ettore Di Giacinto
7e1f2657d5
Update GPU-acceleration.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-06 19:03:34 +02:00
Ettore Di Giacinto
9589097252
chore(model gallery): add nano_imp_1b-q8_0 ( #5800 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-06 18:58:56 +02:00
Ettore Di Giacinto
cb87d331a9
chore(model gallery): add sicariussicariistuff_impish_llama_4b ( #5799 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-06 18:57:18 +02:00
Ettore Di Giacinto
6dfc96249a
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-06 18:07:36 +02:00
LocalAI [bot]
a2564ed654
chore: ⬆️ Update ggml-org/llama.cpp to a0374a67e2924f2e845cdc59dd67d9a44065a89c ( #5798 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-05 22:48:28 +00:00
LocalAI [bot]
6c747caa34
chore: ⬆️ Update ggml-org/llama.cpp to ef797db357e44ecb7437fa9d22f4e1614104b342 ( #5795 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-04 22:46:51 +00:00
Ettore Di Giacinto
8ae5e0feb9
chore(model gallery): add ockerman0_anubislemonade-70b-v1 ( #5794 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-04 18:43:35 +02:00
Ettore Di Giacinto
c35dd0a7b8
chore(model gallery): add zerofata_ms3.2-paintedfantasy-visage-33b ( #5793 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-04 18:42:01 +02:00
Ettore Di Giacinto
2f5af6b246
chore(model gallery): add agentica-org_deepswe-preview ( #5792 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-04 18:39:36 +02:00
Ettore Di Giacinto
00cf2e0e0a
chore(model gallery): add helpingai_dhanishtha-2.0-preview ( #5791 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-04 18:38:18 +02:00
LocalAI [bot]
c7a1d9c089
chore: ⬆️ Update ggml-org/llama.cpp to bee28421be25fd447f61cb6db64d556cbfce32ec ( #5788 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-04 08:46:56 +02:00
LocalAI [bot]
ad7ba52166
chore: ⬆️ Update PABannier/bark.cpp to 5d5be84f089ab9ea53b7a793f088d3fbf7247495 ( #4786 )
...
⬆️ Update PABannier/bark.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-03 22:44:53 +00:00
Ettore Di Giacinto
c5b9f45166
chore(cli): add backends CLI to manipulate and install backends ( #5787 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-03 19:31:27 +02:00
Ettore Di Giacinto
61b64a65ab
chore(bark-cpp): generalize and move to bark-cpp ( #5786 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-03 19:31:10 +02:00
Ettore Di Giacinto
8276952920
feat(system): detect and allow to override capabilities ( #5785 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-03 19:30:52 +02:00
Ettore Di Giacinto
b7cd5bfaec
feat(backends): add metas in the gallery ( #5784 )
...
* chore(backends): add metas in the gallery
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: correctly handle aliases and metas with same names
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-03 18:01:55 +02:00
LocalAI [bot]
da4312e4d3
chore: ⬆️ Update ggml-org/llama.cpp to e75ba4c0434eb759eb7ff74e034ebe729053e575 ( #5783 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-03 10:01:17 +02:00
LocalAI [bot]
7d507c54ed
chore: ⬆️ Update ggml-org/whisper.cpp to d9999d54c868b8bfcd376aa26067e787d53e679e ( #5782 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-03 09:57:36 +02:00
LocalAI [bot]
df7ed49889
docs: ⬆️ update docs version mudler/LocalAI ( #5781 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-02 22:45:21 +00:00
Ettore Di Giacinto
bfdc29d316
fix(gallery): correctly show status for downloading OCI images ( #5774 )
...
We can't use the mutate.Extract written bytes as current status as that
will be bigger than the compressed image size. Image manifest don't have
any guarantee of the type of artifact (can be compressed or not) when
showing the layer size.
Split the extraction process in two parts: Downloading and extracting as
a flattened system, in this way we can display the status of downloading
and extracting accordingly.
This change also fixes a small nuance in detecting installed backends,
now it's more consistent and looks if a metadata.json and/or a path with
a `run.sh` file is present.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-02 08:25:48 +02:00
LocalAI [bot]
7fdc006071
chore: ⬆️ Update ggml-org/llama.cpp to de569441470332ff922c23fb0413cc957be75b25 ( #5777 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-02 08:25:29 +02:00
LocalAI [bot]
615830245b
chore: ⬆️ Update ggml-org/whisper.cpp to bca021c9740b267c2973fba56555be052006023a ( #5776 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-02 08:24:58 +02:00
LocalAI [bot]
61376c0fa7
docs: ⬆️ update docs version mudler/LocalAI ( #5775 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-01 22:45:24 +00:00
Ettore Di Giacinto
d0fb23514f
Revert "fix(gallery): correctly show status for downloading OCI images"
...
This reverts commit 780d034ac9 .
2025-07-01 21:32:04 +02:00
Ettore Di Giacinto
780d034ac9
fix(gallery): correctly show status for downloading OCI images
...
We can't use the mutate.Extract written bytes as current status as that
will be bigger than the compressed image size. Image manifest don't have
any guarantee of the type of artifact (can be compressed or not) when
showing the layer size.
Split the extraction process in two parts: Downloading and extracting as
a flattened system, in this way we can display the status of downloading
and extracting accordingly.
This change also fixes a small nuance in detecting installed backends,
now it's more consistent and looks if a metadata.json and/or a path with
a `run.sh` file is present.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-01 19:56:28 +02:00
Ettore Di Giacinto
ec2a044c7e
chore(model gallery): add pinkpixel_crystal-think-v2 ( #5773 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-01 16:18:19 +02:00
Ettore Di Giacinto
ad6fdd21fd
chore(model gallery): add steelskull_l3.3-shakudo-70b ( #5772 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-01 16:15:22 +02:00
Ettore Di Giacinto
cd94e6b352
chore(model gallery): add thedrummer_anubis-70b-v1.1 ( #5771 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-01 13:54:29 +02:00
Richard Palethorpe
b37cef3718
fix: Diffusers and XPU fixes ( #5737 )
...
* fix(README): Add device flags for Intel/XPU
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(diffusers/xpu): Set device to XPU and ignore CUDA request when on Intel
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-01 12:36:17 +02:00
Dedy F. Setyawan
9f957d547d
fix(docs): Improve Header Responsiveness - Hide "Star us on GitHub!" on Mobile ( #5770 )
2025-07-01 12:15:16 +02:00
LocalAI [bot]
f0d9f0c5d8
chore: ⬆️ Update ggml-org/llama.cpp to 0a5a3b5cdfd887cf0f8e09d9ff89dee130cfcdde ( #5759 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-30 22:46:55 +00:00
LocalAI [bot]
d33e1c72a3
chore: ⬆️ Update ggml-org/llama.cpp to caf5681fcb47dfe9bafee94ef9aa8f669ac986c7 ( #5758 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-29 22:49:47 +00:00
Ettore Di Giacinto
33f9ee06c9
fix(gallery): automatically install model from name ( #5757 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-29 17:42:58 +02:00
Ettore Di Giacinto
c54677402d
chore(model gallery): add qwen3-33b-a3b-stranger-thoughts-abliterated-uncensored ( #5755 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-29 10:04:33 +02:00
LocalAI [bot]
3fe3a7b23d
chore: ⬆️ Update ggml-org/llama.cpp to 27208bf657cfe7262791df473927225e48efe482 ( #5753 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-29 09:06:37 +02:00
LocalAI [bot]
f8ff6fa1fd
docs: ⬆️ update docs version mudler/LocalAI ( #5752 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-28 22:17:49 +02:00
Ettore Di Giacinto
dfadc3696e
feat(llama.cpp): allow to set kv-overrides ( #5745 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 21:26:07 +02:00
Ettore Di Giacinto
dbcf5fb4fc
chore(model gallery): add gemma-3-4b-it-max-horror-uncensored-dbl-x-imatrix ( #5751 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 18:18:14 +02:00
Ettore Di Giacinto
2633137a17
chore(model gallery): add qwen3-22b-a3b-the-harley-quinn ( #5750 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 18:17:10 +02:00
Ettore Di Giacinto
d9c17dd23b
chore(model gallery): add mistral-small-3.2-46b-the-brilliant-raconteur-ii-instruct-2506 ( #5749 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 18:15:25 +02:00
Ettore Di Giacinto
d8b7bd4860
chore(model gallery): add qwen3-42b-a3b-stranger-thoughts-deep20x-abliterated-uncensored-i1 ( #5748 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 18:12:17 +02:00
Ettore Di Giacinto
a611cbc0f4
chore(model gallery): add qwen3-55b-a3b-total-recall-deep-40x ( #5747 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 17:54:32 +02:00
Ettore Di Giacinto
850b525159
chore(model gallery): add qwen3-55b-a3b-total-recall-v1.3-i1 ( #5746 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-28 17:47:46 +02:00
Ettore Di Giacinto
35b3426a2a
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-28 09:16:25 +02:00
LocalAI [bot]
cd2b0c0e7c
chore: ⬆️ Update ggml-org/llama.cpp to 72babea5dea56c8a8e8420ccf731b12a5cf37854 ( #5743 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-27 23:46:27 +02:00
LocalAI [bot]
73d80c43a8
chore: ⬆️ Update ggml-org/whisper.cpp to c88ffbf9baeaae8c2cc0a4f496618314bb2ee9e0 ( #5742 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-27 23:45:57 +02:00
LocalAI [bot]
665562b850
docs: ⬆️ update docs version mudler/LocalAI ( #5741 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-27 22:23:43 +02:00
Ettore Di Giacinto
7a78e4f482
fix(backends gallery): meta packages do not have URIs ( #5740 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-27 22:23:14 +02:00
Ettore Di Giacinto
6f41a6f934
fix(backends gallery): correctly identify gpu vendor ( #5739 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-27 22:22:58 +02:00
Ettore Di Giacinto
bb54f2da2b
feat(gallery): automatically install missing backends along models ( #5736 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-27 18:25:44 +02:00
Ettore Di Giacinto
e1cc7ee107
fix(ci): enable tag-latest to auto ( #5738 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-27 18:17:01 +02:00
Ettore Di Giacinto
cfc9dfa3d5
fix(ci): better handling of latest images for backends ( #5735 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-27 10:32:58 +02:00
LocalAI [bot]
6a650e68cb
chore: ⬆️ Update ggml-org/whisper.cpp to 32cf4e2aba799aff069011f37ca025401433cf9f ( #5733 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-26 22:09:26 +02:00
LocalAI [bot]
5e1373877a
chore: ⬆️ Update ggml-org/llama.cpp to 8846aace4934ad29651ea61b8c7e3f6b0556e3d2 ( #5734 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-26 22:09:03 +02:00
Ettore Di Giacinto
b5b0ab26e7
fix(ci): remove non-existant input from build matrix
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-26 21:42:27 +02:00
Ettore Di Giacinto
9725bb4bbd
chore(model gallery): add gemma-3n-e4b-it ( #5731 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-26 19:36:50 +02:00
Ettore Di Giacinto
33b4275bbc
chore(model gallery): add gemma-3n-e2b-it ( #5730 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-26 19:35:49 +02:00
Ettore Di Giacinto
6644af10c6
feat: ⚠️ reduce images size and stop bundling sources ( #5721 )
...
feat: reduce images size and stop bundling sources
Do not copy sources anymore, and reduce packages of the base images by
not using builder images.
If needed to rebuild, just build the container image from scratch by
following the docs. We will slowly try to migrate all backends to the
gallery to keep the core small.
This PR is a breaking change, it also sets the base folders to /models
and /backends instead of /build/models and /build/backends.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-26 18:41:38 +02:00
Ettore Di Giacinto
7c4a2e9b85
chore(ci): ⚠️ fix latest tag by using docker meta action ( #5722 )
...
chore(ci): fix latest tag by using docker meta action
Also uniform tagging names
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-26 18:40:25 +02:00
Ettore Di Giacinto
bcccee3909
fix(backends gallery): delete dangling dirs if installation failed ( #5729 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-26 17:38:03 +02:00
Ettore Di Giacinto
c6f50ddd0c
Revert "chore: ⬆️ Update leejet/stable-diffusion.cpp to 10c6501bd05a697e014f1bee3a84e5664290c489" ( #5727 )
...
Revert "chore: ⬆️ Update leejet/stable-diffusion.cpp to `10c6501bd05a…"
This reverts commit 30600dd5cb .
2025-06-26 13:25:25 +02:00
LocalAI [bot]
6613373b1b
chore: ⬆️ Update ggml-org/whisper.cpp to 4daf7050ca2bf17f5166f45ac6da651c4e33f293 ( #5725 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-26 13:00:20 +02:00
LocalAI [bot]
1659b3f795
chore: ⬆️ Update ggml-org/llama.cpp to 2bf9d539dd158345e3a3b096e16474af535265b4 ( #5724 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-26 12:59:57 +02:00
LocalAI [bot]
30600dd5cb
chore: ⬆️ Update leejet/stable-diffusion.cpp to 10c6501bd05a697e014f1bee3a84e5664290c489 ( #4925 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-25 22:45:15 +00:00
Ettore Di Giacinto
179fcf5541
chore(model gallery): add menlo_jan-nano-128k ( #5723 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-25 12:14:11 +02:00
LocalAI [bot]
9cb75086bb
chore: ⬆️ Update ggml-org/whisper.cpp to 0083335ba0e9d6becbe0958903b0a27fc2ebaeed ( #5718 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-25 09:47:33 +02:00
LocalAI [bot]
594bb462ab
chore: ⬆️ Update ggml-org/llama.cpp to 73e53dc834c0a2336cd104473af6897197b96277 ( #5719 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-24 22:47:48 +00:00
Ettore Di Giacinto
aa730a7b96
chore(model gallery): add delta-vector_austral-24b-winton ( #5717 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-24 18:37:28 +02:00
Ettore Di Giacinto
0a454c527a
chore(model gallery): add astrosage-70b ( #5716 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-24 18:34:37 +02:00
Ettore Di Giacinto
cf86bcb984
chore(model gallery): add skywork_skywork-swe-32b ( #5715 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-24 18:29:36 +02:00
Ettore Di Giacinto
a6d9988e84
feat(backend gallery): add meta packages ( #5696 )
...
* feat(backend gallery): add meta packages
So we can have meta packages such as "vllm" that automatically installs
the corresponding package depending on the GPU that is being currently
detected in the system.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: use a metadata file
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-24 17:08:27 +02:00
Ettore Di Giacinto
f3a114342e
chore(model gallery): add mistralai_mistral-small-3.2-24b-instruct-2506 ( #5714 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-24 13:59:14 +02:00
LocalAI [bot]
0d275ccc03
chore: ⬆️ Update ggml-org/llama.cpp to ce82bd0117bd3598300b3a089d13d401b90279c7 ( #5712 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-24 08:37:32 +02:00
LocalAI [bot]
58dba3f01c
chore: ⬆️ Update ggml-org/whisper.cpp to a422176937c5bb20eb58d969995765f90d3c1a9b ( #5713 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-23 22:45:31 +00:00
kilavvy
b68d6e8088
Docs: Fix typos ( #5709 )
...
* Update GPU-acceleration.md
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
* Update image-generation.md
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
---------
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
2025-06-23 18:15:06 +02:00
LocalAI [bot]
2352cec7e6
chore: ⬆️ Update ggml-org/llama.cpp to 238005c2dc67426cf678baa2d54c881701693288 ( #5710 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-22 22:43:41 +00:00
Ettore Di Giacinto
de72ae79b5
chore(model gallery): add ds-r1-qwen3-8b-arliai-rpr-v4-small-iq-imatrix ( #5708 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-22 09:05:55 +02:00
Ettore Di Giacinto
884c07d5f9
chore(model gallery): add allura-org_q3-8b-kintsugi ( #5707 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-22 09:04:49 +02:00
Ettore Di Giacinto
cca7cbef1e
chore(model gallery): add qwen3-the-xiaolong-omega-directive-22b-uncensored-abliterated-i1 ( #5706 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-22 09:01:08 +02:00
Ettore Di Giacinto
32cd0d03d4
chore(model gallery): add menlo_jan-nano ( #5705 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-22 08:57:33 +02:00
Ettore Di Giacinto
ee4d9e83d0
Update stalebot.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-22 08:51:13 +02:00
LocalAI [bot]
5547e08a30
chore: ⬆️ Update ggml-org/llama.cpp to aa0ef5c578eef4c2adc7be1282f21bab5f3e8d26 ( #5703 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-21 23:54:53 +02:00
LocalAI [bot]
ca7385c303
chore: ⬆️ Update ggml-org/whisper.cpp to e6c10cf3d5d60dc647eb6cd5e73d3c347149f746 ( #5702 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-21 23:54:28 +02:00
Ettore Di Giacinto
28759e79d3
chore(model gallery): add qwen3-the-josiefied-omega-directive-22b-uncensored-abliterated-i1 ( #5704 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-21 23:54:05 +02:00
Ettore Di Giacinto
40249b6b84
Update stalebot.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-21 22:38:23 +02:00
Ettore Di Giacinto
e09e47bada
chore(ci): add stale bot ( #5700 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-21 20:12:08 +02:00
Ettore Di Giacinto
3796558aeb
Update quickstart.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-21 20:11:57 +02:00
LocalAI [bot]
cca4f010f8
chore: ⬆️ Update ggml-org/llama.cpp to 06cbedfca1587473df9b537f1dd4d6bfa2e3de13 ( #5697 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-20 22:44:39 +00:00
Ettore Di Giacinto
be3ff482d0
chore(ci): try to optimize disk space when tagging latest ( #5695 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-20 15:54:14 +02:00
LocalAI [bot]
af255cd0be
chore: ⬆️ Update ggml-org/llama.cpp to 8f71d0f3e86ccbba059350058af8758cafed73e6 ( #5692 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-20 15:53:55 +02:00
LocalAI [bot]
8000228d1b
chore: ⬆️ Update ggml-org/whisper.cpp to 3e65f518ddf840b13b74794158aa95a2c8aa30cc ( #5691 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-20 15:53:38 +02:00
Ettore Di Giacinto
79abe0ad77
Drop latest references to extras images
2025-06-20 15:51:16 +02:00
Ettore Di Giacinto
8131d11d1f
Update quickstart.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-19 22:42:38 +02:00
LocalAI [bot]
beb01c91f3
docs: ⬆️ update docs version mudler/LocalAI ( #5690 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-19 22:13:16 +02:00
Ettore Di Giacinto
1ccd64ff6a
chore: drop extras references from docs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-19 22:04:28 +02:00
Ettore Di Giacinto
fc7681c68c
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-19 21:46:09 +02:00
Ettore Di Giacinto
49d026a229
Update backends.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-19 19:47:09 +02:00
leopardracer
f9b968e19d
Fix Typos and Improve Clarity in GPU Acceleration Documentation ( #5688 )
...
Update GPU-acceleration.md
Signed-off-by: leopardracer <136604165+leopardracer@users.noreply.github.com >
2025-06-19 15:41:13 +02:00
LocalAI [bot]
022d4a5ecb
chore: ⬆️ Update ggml-org/whisper.cpp to ecb8f3c2b4e282d5ef416516bcbfb92821f06bf6 ( #5686 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-19 08:23:42 +02:00
LocalAI [bot]
0e917eb01d
chore: ⬆️ Update ggml-org/llama.cpp to 8d947136546773f6410756f37fcc5d3e65b8135d ( #5685 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-19 08:23:23 +02:00
Ettore Di Giacinto
efde0eaf83
feat(backend gallery): display download progress ( #5687 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 23:49:44 +02:00
Maxim Evtush
add8fc35a2
Fix Typos in Documentation and Python Comments ( #5658 )
...
* Update istftnet.py
Signed-off-by: Maxim Evtush <154841002+maximevtush@users.noreply.github.com >
* Update GPU-acceleration.md
Signed-off-by: Maxim Evtush <154841002+maximevtush@users.noreply.github.com >
---------
Signed-off-by: Maxim Evtush <154841002+maximevtush@users.noreply.github.com >
2025-06-18 22:11:13 +02:00
Ettore Di Giacinto
9bcf4c56f1
fix(backends gallery): propagate p2p settings to correctly draw menu ( #5684 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 22:06:12 +02:00
Ettore Di Giacinto
3fcfaec7c8
chore(ci): move also other jobs to public runner ( #5683 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 22:00:12 +02:00
Ettore Di Giacinto
a463d40a3e
chore(ci): try to use public runners also for release builds ( #5681 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 21:51:54 +02:00
Ettore Di Giacinto
1e1f0ee321
chore(backends): move bark-cpp to the backend gallery ( #5682 )
...
chore(bark-cpp): move outside from binary
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 19:48:50 +02:00
Ettore Di Giacinto
80b3139fa0
Update landing.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-18 19:48:17 +02:00
LocalAI [bot]
5173d37acb
chore: ⬆️ Update ggml-org/llama.cpp to 860a9e4eeff3eb2e7bd1cc38f65787cc6c8177af ( #5678 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-18 10:01:46 +02:00
LocalAI [bot]
470e48a900
chore: ⬆️ Update ggml-org/whisper.cpp to f3ff80ea8da044e5b8833e7ba54ee174504c518d ( #5677 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-18 10:01:08 +02:00
Ettore Di Giacinto
b706dddc93
chore(ci): switch to public runners for base images ( #5680 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 22:38:50 +02:00
Ettore Di Giacinto
867db3f888
chore(docs): add backend url
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 22:35:21 +02:00
Ettore Di Giacinto
b79aa31398
chore: move backends docs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 22:26:40 +02:00
Ettore Di Giacinto
fb9a09d49c
chore(backend gallery): add description for remaining backends ( #5679 )
...
* chore(backend gallery): add description for remaining backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(backend gallery): add linter
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 22:21:44 +02:00
Ettore Di Giacinto
0a78f0ad2d
chore(backend gallery): re-order and add description for vLLM ( #5676 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 17:31:53 +02:00
Ettore Di Giacinto
d68660bd5a
chore(deps): bump llama.cpp to 'e434e69183fd9e1031f4445002083178c331a28b ( #5665 )
...
chore(deps): bump llama.cpp to 'e434e69183fd9e1031f4445002083178c331a28b'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 17:00:10 +02:00
LocalAI [bot]
30ceee2dec
chore: ⬆️ Update ggml-org/whisper.cpp to 2a4d6db7d90899aff3d58d70996916968e4e0d27 ( #5661 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-17 09:21:05 +02:00
dependabot[bot]
18c38335fc
chore(deps): bump securego/gosec from 2.22.4 to 2.22.5 ( #5663 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.4 to 2.22.5.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.4...v2.22.5 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.5
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-06-16 23:12:27 +00:00
Ettore Di Giacinto
89040ff6f7
fix: add python symlink, use absolute python env path when running backends ( #5664 )
...
* fix: add python symlink, use absolute python env path when running backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(ci): do not push images when building PRs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-16 23:00:53 +02:00
Ettore Di Giacinto
de343700fd
Don't run python_backend workflow on PR
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-16 11:06:56 +02:00
Ettore Di Giacinto
87d18ad951
chore: Add python3 to images ( #5660 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-16 11:05:44 +02:00
Ettore Di Giacinto
912c8eff04
chore(ci): use public runner for extra backends ( #5657 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-16 08:21:18 +02:00
LocalAI [bot]
481f30bde8
chore: ⬆️ Update ggml-org/llama.cpp to 30e5b01de2a0bcddc7c063c8ef0802703a958417 ( #5659 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-15 23:03:40 +00:00
Ettore Di Giacinto
236ac30252
chore(ci): do not specify image-type anymore
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 17:28:40 +02:00
Ettore Di Giacinto
6f761e62e4
update README
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 16:06:43 +02:00
FT
1f29b5f38e
Fix Typos and Improve Documentation Clarity ( #5648 )
...
* Update p2p.go
Signed-off-by: FT <140458077+zeevick10@users.noreply.github.com >
* Update GPU-acceleration.md
Signed-off-by: FT <140458077+zeevick10@users.noreply.github.com >
---------
Signed-off-by: FT <140458077+zeevick10@users.noreply.github.com >
2025-06-15 16:04:44 +02:00
LocalAI [bot]
33d702c5e0
chore: ⬆️ Update ggml-org/llama.cpp to 3cb203c89f60483e349f841684173446ed23c28f ( #5644 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 16:03:13 +02:00
Ettore Di Giacinto
95ff236127
ci: do not fire python_backend on PRs
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 16:02:30 +02:00
Ettore Di Giacinto
2d64269763
feat: Add backend gallery ( #5607 )
...
* feat: Add backend gallery
This PR add support to manage backends as similar to models. There is
now available a backend gallery which can be used to install and remove
extra backends.
The backend gallery can be configured similarly as a model gallery, and
API calls allows to install and remove new backends in runtime, and as
well during the startup phase of LocalAI.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add backends docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* wip: Backend Dockerfile for python backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: drop extras images, build python backends separately
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixup on all backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* test CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Tweaks
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop old backends leftovers
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixup CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Move dockerfile upper
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix proto
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Feature dropped for consistency - we prefer model galleries
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add missing packages in the build image
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* exllama is ponly available on cublas
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* pin torch on chatterbox
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups to index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Debug CI
* Install accellerators deps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add target arch
* Add cuda minor version
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use self-hosted runners
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* ci: use quay for test images
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups for vllm and chatterbox
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small fixups on CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chatterbox is only available for nvidia
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Simplify CI builds
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt test, use qwen3
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(model gallery): add jina-reranker-v1-tiny-en-gguf
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(gguf-parser): recover from potential panics that can happen while reading ggufs with gguf-parser
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use reranker from llama.cpp in AIO images
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Limit concurrent jobs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 14:56:52 +02:00
LocalAI [bot]
a7a6020328
chore: ⬆️ Update ggml-org/whisper.cpp to 705db0f728310c32bc96f4e355e2b18076932f75 ( #5643 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-15 08:39:00 +02:00
Ettore Di Giacinto
40618164b2
chore: improve tests ( #5646 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-14 10:07:05 +02:00
fuder.eth
eb8c29f90a
Minor Documentation Updates: Clarified Comments in Python and Go Files ( #5641 )
...
* Update ui.go
Signed-off-by: fuder.eth <139509124+vtjl10@users.noreply.github.com >
* Update backend.py
Signed-off-by: fuder.eth <139509124+vtjl10@users.noreply.github.com >
---------
Signed-off-by: fuder.eth <139509124+vtjl10@users.noreply.github.com >
2025-06-13 19:55:25 +02:00
Gavin Mogan
63116a2c6a
docs: Update docs metadata headers so when mentioned on slack it doesn't say hugo ( #5642 )
...
Update docs metadata headers so when mentioned on slack it doesn't say hugo
Signed-off-by: Gavin Mogan <github@gavinmogan.com >
2025-06-13 19:54:57 +02:00
LocalAI [bot]
311c2cf539
chore: ⬆️ Update ggml-org/llama.cpp to ed52f3668e633423054a4eab61bb7efee47025ab ( #5636 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-12 23:33:33 +02:00
Ettore Di Giacinto
a6fcbd991d
chore(model gallery): add yanfei-v2-qwen3-32b ( #5639 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-12 22:24:13 +02:00
kilavvy
2e1dc8deef
Fix Typos in Comments and Error Messages ( #5637 )
...
* Update initializers.go
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
* Update base.go
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
---------
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
2025-06-12 18:34:32 +02:00
LocalAI [bot]
282e017b22
chore: ⬆️ Update ggml-org/whisper.cpp to ebbc874e85b518f963a87612f6d79f5c71a55e84 ( #5635 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-11 23:47:00 +02:00
Ettore Di Giacinto
f86cb8be2d
chore(model gallery): add qwen3-embedding-0.6b ( #5634 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:40:41 +02:00
Ettore Di Giacinto
5c56ec4f87
chore(model gallery): add qwen3-embedding-8b ( #5633 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:38:44 +02:00
Ettore Di Giacinto
dd2845a034
chore(model gallery): add qwen3-embedding-4b ( #5632 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:31:43 +02:00
Ettore Di Giacinto
2e7db014b6
chore(model gallery): add openbuddy_openbuddy-r1-0528-distill-qwen3-32b-preview0-qat ( #5631 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:27:30 +02:00
Ettore Di Giacinto
6faeee1d92
chore(model gallery): add baai_robobrain2.0-7b ( #5630 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:17:32 +02:00
Ettore Di Giacinto
31d73eb934
chore(model gallery): add mistralai_magistral-small-2506 ( #5629 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:11:44 +02:00
Ettore Di Giacinto
60863b9e52
chore(model gallery): add sophosympatheia_strawberrylemonade-l3-70b-v1.0 ( #5628 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:08:17 +02:00
Ettore Di Giacinto
a9fc71e2f3
chore(model gallery): add kwaipilot_kwaicoder-autothink-preview ( #5627 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:06:38 +02:00
leopardracer
ce9a9a30e0
Improve Comments and Documentation for MixedMode and ParseJSON Functions ( #5626 )
...
Update parse.go
Signed-off-by: leopardracer <136604165+leopardracer@users.noreply.github.com >
2025-06-11 09:46:53 +02:00
LocalAI [bot]
2693a21da5
chore: ⬆️ Update ggml-org/whisper.cpp to 2679bec6e09231c6fd59715fcba3eebc9e2f6076 ( #5625 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-11 09:35:28 +02:00
LocalAI [bot]
d460eab18e
chore: ⬆️ Update ggml-org/llama.cpp to 3678b838bb71eaccbaeb479ff38c2e12bfd2f960 ( #5620 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-11 09:00:39 +02:00
LocalAI [bot]
c61e5fe266
chore: ⬆️ Update ggml-org/whisper.cpp to d78f08142381c1460604713e2f2ddf3331c7d816 ( #5619 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-10 17:29:58 +02:00
Ettore Di Giacinto
88e570b5de
fix(deps): pin grpcio ( #5621 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-10 14:21:51 +02:00
Ettore Di Giacinto
6efa97ce0b
chore(model gallery): add qwen2.5-omni-3b ( #5606 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-09 10:54:42 +02:00
LocalAI [bot]
41cde5468a
chore: ⬆️ Update ggml-org/llama.cpp to 247e5c6e447707bb4539bdf1913d206088a8fc69 ( #5605 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-09 00:11:46 +02:00
Richard Palethorpe
d650647db9
fix(realtime): Use updated model on session update ( #5604 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-06-09 00:11:05 +02:00
LocalAI [bot]
5bc7ef37a2
chore: ⬆️ Update ggml-org/llama.cpp to 5787b5da57e54dba760c2deeac1edf892e8fc450 ( #5601 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-08 08:44:24 +02:00
Ettore Di Giacinto
e0a52807c8
chore(model gallery): add akhil-theerthala_kuvera-8b-v0.1.0 ( #5600 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-07 08:59:20 +02:00
LocalAI [bot]
1a95a19f87
chore: ⬆️ Update ggml-org/llama.cpp to 745aa5319b9930068aff5e87cf5e9eef7227339b ( #5598 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-07 08:59:05 +02:00
LocalAI [bot]
bcfc08e5bf
chore: ⬆️ Update ggml-org/whisper.cpp to b175baa665bc35f97a2ca774174f07dfffb84e19 ( #5597 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-07 08:57:52 +02:00
Ettore Di Giacinto
4d282ca963
chore(model gallery): add nbeerbower_qwen3-gutenberg-encore-14b ( #5596 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-06 10:20:48 +02:00
Ettore Di Giacinto
525f49b69d
chore(model gallery): add open-thoughts_openthinker3-7b ( #5595 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-06 10:14:00 +02:00
LocalAI [bot]
786aa1de05
chore: ⬆️ Update ggml-org/llama.cpp to 1caae7fc6c77551cb1066515e0f414713eebb367 ( #5593 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-06 00:10:02 +02:00
Ettore Di Giacinto
ea82deb16b
chore(model gallery): add ultravox-v0_5-llama-3_1-8b ( #5592 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-05 19:23:51 +02:00
Ettore Di Giacinto
b0891309ba
chore(model gallery): add ultravox-v0_5-llama-3_2-1b ( #5591 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-05 19:22:01 +02:00
Ettore Di Giacinto
b034cff149
feat: improve RAM estimation by using values from summary ( #5525 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-05 19:16:26 +02:00
Ettore Di Giacinto
432f34f001
chore(model gallery): add goekdeniz-guelmez_josiefied-qwen3-14b-abliterated-v3 ( #5590 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-05 19:16:04 +02:00
Gavin Mogan
cbd61dccd4
fix(install.sh): vulkan docker tag ( #5589 )
...
vulkan docker tag is not prefixed with gpu
```
regctl tag ls localai/localai | grep 2.29 | grep vulkan
v2.29.0-vulkan
```
Signed-off-by: Gavin Mogan <github@gavinmogan.com >
2025-06-05 08:12:16 +02:00
LocalAI [bot]
0de0817d71
chore: ⬆️ Update ggml-org/whisper.cpp to 799eacdde40b3c562cfce1508da1354b90567f8f ( #5586 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-05 08:11:38 +02:00
LocalAI [bot]
bf57d6e5ac
chore: ⬆️ Update ggml-org/llama.cpp to 0d3984424f2973c49c4bcabe4cc0153b4f90c601 ( #5585 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-05 08:11:12 +02:00
Ettore Di Giacinto
0b9603e010
chore(model gallery): add deepseek-ai_deepseek-r1-0528-qwen3-8b ( #5580 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 15:28:45 +02:00
Ettore Di Giacinto
8d925217f6
chore(model gallery): add e-n-v-y_legion-v2.1-llama-70b-elarablated-v0.8-hf ( #5579 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 11:12:37 +02:00
Ettore Di Giacinto
669a1ccae6
chore(model gallery): add nvidia_nemotron-research-reasoning-qwen-1.5b ( #5578 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 11:07:10 +02:00
Ettore Di Giacinto
7a7d36ad63
chore(model gallery): add arcee-ai_homunculus ( #5577 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 10:02:15 +02:00
Ettore Di Giacinto
8b889955b4
chore(deps): bump pytorch to 2.7 in vllm ( #5576 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 08:56:45 +02:00
dependabot[bot]
a226555949
chore(deps): bump GrantBirki/git-diff-action from 2.8.0 to 2.8.1 ( #5564 )
...
Bumps [GrantBirki/git-diff-action](https://github.com/grantbirki/git-diff-action ) from 2.8.0 to 2.8.1.
- [Release notes](https://github.com/grantbirki/git-diff-action/releases )
- [Commits](https://github.com/grantbirki/git-diff-action/compare/v2.8.0...v2.8.1 )
---
updated-dependencies:
- dependency-name: GrantBirki/git-diff-action
dependency-version: 2.8.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-06-04 08:41:47 +02:00
LocalAI [bot]
f38f17865a
chore: ⬆️ Update ggml-org/whisper.cpp to 82f461eaa4e6a1ba29fc0dbdaa415a9934ee8a1d ( #5575 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-04 08:41:26 +02:00
LocalAI [bot]
03f380701b
chore: ⬆️ Update ggml-org/llama.cpp to 7e00e60ef86645a01fda738fef85b74afa016a34 ( #5574 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-04 08:26:36 +02:00
Ettore Di Giacinto
65e2866c97
fix(chatterbox): install only with cuda 12 ( #5573 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-03 14:57:47 +02:00
Ettore Di Giacinto
cd3cd899ad
chore(deps): bump llama.cpp to '363757628848a27a435bbf22ff9476e9aeda5f40' ( #5571 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-03 12:19:16 +02:00
LocalAI [bot]
c2ae3100e7
chore: ⬆️ Update ggml-org/whisper.cpp to e05af2457b7b4134ee626dc044294a19b096e62f ( #5569 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-03 11:29:18 +02:00
Ettore Di Giacinto
ec0868e691
chore(deps): bump grpcio from 1.72.0 to 1.72.1 ( #5570 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-03 09:59:43 +02:00
Ettore Di Giacinto
489c289916
Revert "fix(ci): try to add different mirrors to avoid 403 issues" ( #5555 )
...
Revert "fix(ci): try to add different mirrors to avoid 403 issues (#5554 )"
This reverts commit 7c9f011d91 .
2025-06-02 08:46:29 +02:00
LocalAI [bot]
ac5fb50bcc
chore: ⬆️ Update ggml-org/whisper.cpp to 7fd6fa809749078aa00edf945e959c898f2bd1af ( #5556 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-02 08:45:47 +02:00
Ettore Di Giacinto
7c9f011d91
fix(ci): try to add different mirrors to avoid 403 issues ( #5554 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-01 08:48:53 +02:00
Ettore Di Giacinto
80f7f17843
chore(deps): bump llama.cpp to 'e562eece7cb476276bfc4cbb18deb7c0369b2233' ( #5552 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-31 12:46:32 +02:00
LocalAI [bot]
f0c41d6405
chore: ⬆️ Update ggml-org/whisper.cpp to 98dfe8dc264b7d0d1daccfff9a9c043bcc2ece4b ( #5542 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-31 08:51:15 +02:00
Ettore Di Giacinto
8472321a81
feat(ui): display thinking tags appropriately ( #5540 )
...
* fix(streaming): stream complete runes
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(ui): display thinking tags separately
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-31 08:50:46 +02:00
Ettore Di Giacinto
3bac4724ac
fix(streaming): stream complete runes ( #5539 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-31 08:48:05 +02:00
Ettore Di Giacinto
59db154cbc
feat(ui): allow to upload PDF and text files, also add support to multiple input files ( #5538 )
...
* Support file inputs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: support multiple files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* show preview of files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-31 08:47:48 +02:00
Ettore Di Giacinto
1cc4525f15
fix: adapt test to error changes
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-30 17:43:59 +02:00
Ettore Di Giacinto
45c58752e5
feat(ui): add audio upload button in chat view ( #5526 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-30 16:47:31 +02:00
Ettore Di Giacinto
d5c9c717b5
feat(chatterbox): add new backend ( #5524 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-30 10:52:55 +02:00
Ettore Di Giacinto
dd7fa6b9f7
chore(deps): bump llama.cpp to 'e83ba3e460651b20a594e9f2f0f0bffb998d3ce1 ( #5527 )
...
chore(deps): bump llama.cpp to 'e83ba3e460651b20a594e9f2f0f0bffb998d3ce1'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-30 10:29:01 +02:00
LocalAI [bot]
039c318607
chore: ⬆️ Update ggml-org/whisper.cpp to e5e900dd00747f747143ad30a697c8f21ddcd59e ( #5522 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-30 08:34:52 +02:00
Ettore Di Giacinto
0870bf5af6
fix(input): handle correctly case where we pass by string list as inputs ( #5521 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-29 22:06:42 +02:00
Ettore Di Giacinto
6073b9944e
chore(model gallery): add moondream2-20250414 ( #5518 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-29 10:47:11 +02:00
LocalAI [bot]
ef0e0f3777
chore: ⬆️ Update ggml-org/whisper.cpp to 1f5fdbecb411a61b8576242e5170c5ecef24b05a ( #5515 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-29 09:45:23 +02:00
LocalAI [bot]
b7de9e0aa0
chore: ⬆️ Update ggml-org/llama.cpp to d98f2a35fcf4a8d3e660ad48cd19e2a1f3d5b2ef ( #5514 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-29 09:44:36 +02:00
Ettore Di Giacinto
39292407a1
chore(model gallery): add pku-ds-lab_fairyr1-32b ( #5517 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-29 09:43:45 +02:00
Ettore Di Giacinto
f257bf8d14
chore(model gallery): add pku-ds-lab_fairyr1-14b-preview ( #5516 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-29 09:37:08 +02:00
Ettore Di Giacinto
8ca2fb5ef1
chore(model gallery): add qwen2.5-omni-7b ( #5513 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-28 18:15:09 +02:00
LocalAI [bot]
3a790fed13
chore: ⬆️ Update ggml-org/whisper.cpp to 0ed00d9d30e8c984936ff9ed9a4fcd475d6d82e5 ( #5510 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-28 09:00:22 +02:00
LocalAI [bot]
a334f28a07
chore: ⬆️ Update ggml-org/llama.cpp to a3c30846e410c91c11d7bf80978795a03bb03dee ( #5509 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-28 01:39:38 +00:00
Ettore Di Giacinto
dc6663d121
fix(template): we do not always have .Name ( #5508 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 18:44:24 +02:00
LocalAI [bot]
103caf9823
chore: ⬆️ Update ggml-org/llama.cpp to a26c4cc11ec7c6574e3691e90ecdbd67deeea35b ( #5500 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-27 17:13:55 +02:00
Ettore Di Giacinto
4226d2d837
Update index.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-27 10:24:37 +02:00
Ettore Di Giacinto
7434256fc9
chore(model gallery): add ms-24b-mullein-v0 ( #5506 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 10:14:52 +02:00
Ettore Di Giacinto
86a0563ae1
chore(model gallery): add llama3-24b-mullein-v1 ( #5505 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 10:13:40 +02:00
Ettore Di Giacinto
c68951cbfe
chore(model gallery): add mrm8488_qwen3-14b-ft-limo ( #5504 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 10:04:16 +02:00
Ettore Di Giacinto
8408084120
chore(model gallery): add luckyrp-24b ( #5503 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 10:02:25 +02:00
Ettore Di Giacinto
0f2f4c7e23
chore(model gallery): add allura-org_q3-30b-a3b-designant ( #5502 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 09:59:56 +02:00
Ettore Di Giacinto
5ffad3b004
chore(deps): remove pin on transformers ( #5501 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-27 09:24:27 +02:00
Ettore Di Giacinto
e5ccd97b8c
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-26 20:00:31 +02:00
LocalAI [bot]
a3b08d46ec
chore: ⬆️ Update ggml-org/whisper.cpp to ea9f206f18d86c4eb357db9fdc52e4d9dc24435e ( #5464 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-26 19:56:44 +02:00
Ettore Di Giacinto
090f5065fc
chore(deps): bump llama.cpp to 'fef693dc6b959a8e8ba11558fbeaad0b264dd457' ( #5467 )
...
Also try to use a smaller model for integration tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-26 17:19:46 +02:00
Ettore Di Giacinto
88de2ea01a
feat(llama.cpp): add support for audio input ( #5466 )
...
* feat(llama.cpp): add support for audio input
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-26 16:06:03 +02:00
Ettore Di Giacinto
9650d490d4
chore(model gallery): add nvidia_acereason-nemotron-14b ( #5463 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-26 10:08:10 +02:00
Ettore Di Giacinto
4de1c83764
chore(model gallery): add allura-org_q3-30b-a3b-pentiment ( #5462 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-26 09:46:44 +02:00
Ettore Di Giacinto
e5978dc714
chore(model gallery): add medgemma-27b-text-it ( #5461 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-26 09:44:13 +02:00
Ettore Di Giacinto
f784986e19
chore(model gallery): add medgemma-4b-it ( #5460 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-26 09:41:09 +02:00
Richard Palethorpe
bf6426aef2
feat: Realtime API support reboot ( #5392 )
...
* feat(realtime): Initial Realtime API implementation
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: go mod tidy
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* feat: Implement transcription only mode for realtime API
Reduce the scope of the real time API for the initial realease and make
transcription only mode functional.
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* chore(build): Build backends on a separate layer to speed up core only changes
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-25 22:25:05 +02:00
LocalAI [bot]
4a91950848
chore: ⬆️ Update ggml-org/llama.cpp to d13d0f6135803822ec1cd7e3efb49360b88a1bdf ( #5448 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-24 08:50:41 +02:00
LocalAI [bot]
4614ea1685
chore: ⬆️ Update ggml-org/whisper.cpp to 13d92d08ae26031545921243256aaaf0ee057943 ( #5449 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-23 23:44:06 +00:00
Ettore Di Giacinto
f0bf59d1d9
chore(model gallery): add vulpecula-4b ( #5445 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-23 09:51:21 +02:00
Ettore Di Giacinto
83dd678959
chore(model gallery): add whiterabbitneo_whiterabbitneo-v3-7b ( #5444 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-23 09:46:28 +02:00
Ettore Di Giacinto
9d6c9f874a
chore(model gallery): add arliai_qwq-32b-arliai-rpr-v4 ( #5443 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-23 09:34:33 +02:00
LocalAI [bot]
c62f2bb336
chore: ⬆️ Update ggml-org/llama.cpp to 8a1d206f1d2b4e45918b589f3165b4be232f7ba8 ( #5440 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-23 09:22:38 +02:00
LocalAI [bot]
38aeca6f9c
chore: ⬆️ Update ggml-org/whisper.cpp to 78b31ca7824500e429ba026c1a9b48e0b41c50cb ( #5439 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-23 06:22:41 +00:00
Ettore Di Giacinto
3b0cf52f6a
feat(llama.cpp): add reranking ( #5396 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-22 21:49:30 +02:00
LocalAI [bot]
bac3022044
chore: ⬆️ Update ggml-org/whisper.cpp to bd1cb0c8e3a04baa411dc12c1325b6a9f12ee7f4 ( #5424 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-22 21:49:06 +02:00
LocalAI [bot]
cd41701524
chore: ⬆️ Update ggml-org/llama.cpp to 8e186ef0e764c7a620e402d1f76ebad60bf31c49 ( #5423 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-22 21:48:51 +02:00
Ettore Di Giacinto
6a382a1afe
fix(transformers): try to pin to working release ( #5426 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-22 12:50:51 +02:00
Ettore Di Giacinto
8dcab2f9c7
chore(scripts): allow to specify quants ( #5430 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-22 11:53:30 +02:00
Ettore Di Giacinto
1d1d5627f0
chore(model gallery): add delta-vector_archaeo-12b-v2 ( #5429 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-22 11:38:48 +02:00
Ettore Di Giacinto
233b3369ad
chore(model gallery): add mistralai_devstral-small-2505 ( #5428 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-22 11:37:17 +02:00
Ettore Di Giacinto
c587ac0aef
chore(model gallery): add nvidia_llama-3.1-nemotron-nano-4b-v1.1 ( #5427 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-22 11:33:33 +02:00
David Thole
38c5d16b57
feat(docs): updating the documentation on fine tuning and advanced guide. ( #5420 )
...
updating the documentation on fine tuning and advanced guide. This mirrors how modern version of llama.cpp operate
2025-05-21 19:11:00 +02:00
LocalAI [bot]
ef6fc052eb
chore: ⬆️ Update ggml-org/llama.cpp to b7a17463ec190aeee7b9077c606c910fb4688b84 ( #5399 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-21 09:06:09 +02:00
LocalAI [bot]
7ff35c08ac
chore: ⬆️ Update ggml-org/whisper.cpp to 62dc8f7d7b72ca8e75c57cd6a100712c631fa5d5 ( #5398 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-21 09:00:42 +02:00
LocalAI [bot]
43f75ee7f3
chore(model-gallery): ⬆️ update checksum ( #5422 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-21 03:52:39 +00:00
Ettore Di Giacinto
82811a9630
fix(transformers): pin protobuf ( #5421 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 20:28:31 +02:00
Ettore Di Giacinto
04a3d8e5ac
feat(ui): add error page to display errors ( #5418 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 12:17:27 +02:00
Ettore Di Giacinto
9af09b3f8c
chore(model gallery): fixup
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 12:17:21 +02:00
Ettore Di Giacinto
0d590a4044
chore(model gallery): add smolvlm2-256m-video-instruct ( #5417 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 12:03:02 +02:00
Ettore Di Giacinto
e0a54de4f5
chore(model gallery): add smolvlm2-500m-video-instruct ( #5416 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 11:42:30 +02:00
Ettore Di Giacinto
6bc2ae5467
chore(model gallery): add smolvlm2-2.2b-instruct ( #5415 )
...
chore(model gallery): add smolvlm-instruct
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 11:36:22 +02:00
Ettore Di Giacinto
8caaf49f5d
chore(model gallery): add smolvlm-instruct ( #5414 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 11:35:01 +02:00
Ettore Di Giacinto
1db51044bb
chore(model gallery): add smolvlm-500m-instruct ( #5413 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 11:25:32 +02:00
Ettore Di Giacinto
ec21b58008
chore(model gallery): add smolvlm-256m-instruct ( #5412 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 11:15:09 +02:00
Ettore Di Giacinto
996259b529
chore(model gallery): add facebook_kernelllm ( #5411 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 10:31:09 +02:00
Ettore Di Giacinto
f2942cc0e1
chore(model gallery): add thedrummer_valkyrie-49b-v1 ( #5410 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-20 10:28:27 +02:00
Ettore Di Giacinto
f8fbfd4fa3
chore(model gallery): add a-m-team_am-thinking-v1 ( #5395 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-19 17:31:38 +02:00
Ettore Di Giacinto
41e239c67e
chore(model gallery): add soob3123_grayline-qwen3-8b ( #5394 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-19 17:02:43 +02:00
Ettore Di Giacinto
587827e779
chore(model gallery): add soob3123_grayline-qwen3-14b ( #5393 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-19 15:59:07 +02:00
LocalAI [bot]
456b4982ef
chore: ⬆️ Update ggml-org/llama.cpp to 6a2bc8bfb7cd502e5ebc72e36c97a6f848c21c2c ( #5390 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-19 01:25:22 +00:00
Ettore Di Giacinto
159388cce8
chore: memoize detected GPUs ( #5385 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-18 08:55:44 +02:00
LocalAI [bot]
cfc73c7773
chore: ⬆️ Update ggml-org/llama.cpp to e3a7cf6c5bf6a0a24217f88607b06e4405a2b5d9 ( #5384 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-18 01:21:13 +00:00
Ettore Di Giacinto
6d5bde860b
feat(llama.cpp): upgrade and use libmtmd ( #5379 )
...
* WIP
* wip
* wip
* Make it compile
* Update json.hpp
* this shouldn't be private for now
* Add logs
* Reset auto detected template
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Re-enable grammars
* This seems to be broken - 360a9c98e1 (diff-a18a8e64e12a01167d8e98fc) […]cccf0d4eed09d76d879L2998-L3207
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Placeholder
* Simplify image loading
* use completion type
* disable streaming
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* correctly return timings
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Remove some debug logging
* Adapt tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Keep header
* embedding: do not use oai type
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Sync from server.cpp
* Use utils and json directly from llama.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Sync with upstream
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: copy json.hpp from the correct location
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: add httplib
* sync llama.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Embeddiongs: set OAICOMPAT_TYPE_EMBEDDING
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: sync with server.cpp by including it
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* make it darwin-compatible
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-17 16:02:53 +02:00
LocalAI [bot]
6ef383033b
chore: ⬆️ Update ggml-org/whisper.cpp to d1f114da61b1ae1e70b03104fad42c9dd666feeb ( #5381 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-17 00:35:17 +00:00
Richard Palethorpe
cd494089d9
fix(flux): Set CFG=1 so that prompts are followed ( #5378 )
...
The recommendation with Flux is to set CFG to 1 as shown in the
stablediffusion-cpp README.
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-05-16 17:53:54 +02:00
LocalAI [bot]
3033845f94
chore: ⬆️ Update ggml-org/whisper.cpp to 20a20decd94badfd519a07ea91f0bba8b8fc4dea ( #5374 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-16 12:46:16 +02:00
omahs
0f365ac204
fix: typos ( #5376 )
...
Signed-off-by: omahs <73983677+omahs@users.noreply.github.com >
2025-05-16 12:45:48 +02:00
Ettore Di Giacinto
525cf198be
chore(model gallery): add primeintellect_intellect-2 ( #5373 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-15 10:53:52 +02:00
Ettore Di Giacinto
658c2a4f55
chore(model gallery): add thedrummer_rivermind-lux-12b-v1 ( #5372 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-15 10:51:55 +02:00
Ettore Di Giacinto
c987de090d
chore(model gallery): add thedrummer_snowpiercer-15b-v1 ( #5371 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-15 10:04:44 +02:00
Ettore Di Giacinto
04365843e6
chore(model gallery): add skywork_skywork-or1-7b ( #5370 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-15 10:02:07 +02:00
Ettore Di Giacinto
1dc5781679
chore(model gallery): add skywork_skywork-or1-32b ( #5369 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-15 09:58:51 +02:00
LocalAI [bot]
30704292de
chore: ⬆️ Update ggml-org/whisper.cpp to f389d7e3e56bbbfec49fd333551927a0fcbb7213 ( #5367 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-15 00:34:16 +00:00
Ettore Di Giacinto
e52c66c76e
chore(docs/install.sh): image changes ( #5354 )
...
chore(docs): image changes
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-14 19:28:30 +02:00
LocalAI [bot]
cb28aef93b
chore: ⬆️ Update ggml-org/whisper.cpp to f89056057511a1657af90bb28ef3f21e5b1f33cd ( #5364 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-14 09:24:16 +02:00
LocalAI [bot]
029f97c2a2
docs: ⬆️ update docs version mudler/LocalAI ( #5363 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-14 01:54:34 +00:00
Ettore Di Giacinto
3be71be696
fix(ci): tag latest against cpu-only image ( #5362 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-13 22:00:41 +02:00
LocalAI [bot]
6adb019f8f
chore: ⬆️ Update ggml-org/llama.cpp to de4c07f93783a1a96456a44dc16b9db538ee1618 ( #5358 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-13 22:00:19 +02:00
LocalAI [bot]
fcaa0a2f01
chore: ⬆️ Update ggml-org/whisper.cpp to e41bc5c61ae66af6be2bd7011769bb821a83e8ae ( #5357 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-13 21:59:50 +02:00
dependabot[bot]
fd17a3312c
chore(deps): bump securego/gosec from 2.22.3 to 2.22.4 ( #5356 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.3 to 2.22.4.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.3...v2.22.4 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.4
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-05-12 22:01:43 +02:00
dependabot[bot]
12d0fe610b
chore(deps): bump dependabot/fetch-metadata from 2.3.0 to 2.4.0 ( #5355 )
...
Bumps [dependabot/fetch-metadata](https://github.com/dependabot/fetch-metadata ) from 2.3.0 to 2.4.0.
- [Release notes](https://github.com/dependabot/fetch-metadata/releases )
- [Commits](https://github.com/dependabot/fetch-metadata/compare/v2.3.0...v2.4.0 )
---
updated-dependencies:
- dependency-name: dependabot/fetch-metadata
dependency-version: 2.4.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-05-12 22:01:19 +02:00
Ettore Di Giacinto
11c67d16b8
chore(ci): strip 'core' in the image suffix, identify python-based images with 'extras' ( #5353 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-12 09:36:59 +02:00
LocalAI [bot]
63f7c86c4d
chore: ⬆️ Update ggml-org/llama.cpp to 9a390c4829cd3058d26a2e2c09d16e3fd12bf1b1 ( #5351 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-12 09:24:54 +02:00
LocalAI [bot]
ac89bf77bf
chore: ⬆️ Update ggml-org/whisper.cpp to 2e310b841e0b4e7cf00890b53411dd9f8578f243 ( #4785 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-12 01:30:35 +00:00
Ettore Di Giacinto
0395cc02fb
chore(model gallery): add qwen_qwen2.5-vl-72b-instruct ( #5349 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-11 09:46:32 +02:00
Ettore Di Giacinto
616972fca0
chore(model gallery): add qwen_qwen2.5-vl-7b-instruct ( #5348 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-11 09:44:58 +02:00
Ettore Di Giacinto
942fbff62d
chore(model gallery): add gryphe_pantheon-proto-rp-1.8-30b-a3b ( #5347 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-11 09:39:28 +02:00
LocalAI [bot]
2612a0c910
chore: ⬆️ Update ggml-org/llama.cpp to 15e6125a397f6086c1dfdf7584acdb7c730313dc ( #5345 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-11 09:21:46 +02:00
LocalAI [bot]
2dcb6d7247
chore(model-gallery): ⬆️ update checksum ( #5346 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-10 22:24:04 +02:00
Ettore Di Giacinto
6978eec69f
feat(whisper.cpp): gpu support ( #5344 )
...
* fix(whisper.cpp): gpu support
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Try to fix apple tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-10 22:02:40 +02:00
LocalAI [bot]
2fcfe54466
chore: ⬆️ Update ggml-org/llama.cpp to 33eff4024084d1f0c8441b79f7208a52fad79858 ( #5343 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-10 10:07:39 +02:00
Ettore Di Giacinto
4e7506a3be
fix(whisper): add vulkan flag
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-10 08:46:21 +02:00
Ettore Di Giacinto
2a46217f90
Update Makefile
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-09 23:17:18 +02:00
Ettore Di Giacinto
31ff9dbd52
chore(Makefile): small cleanups, disable openmp on whisper
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 22:37:18 +02:00
Ettore Di Giacinto
9483abef03
fix(whisper/sycl): disable
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 22:36:09 +02:00
Ettore Di Giacinto
ce3e8b3e31
fix(whisper/sycl): use icx when running go build
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 21:48:09 +02:00
Ettore Di Giacinto
f3bb84c9a7
feat(whisper): link vulkan, hipblas and sycl
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 19:25:26 +02:00
Ettore Di Giacinto
ecb1297582
fix: specify icx and icpx only on whisper.cpp
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 10:58:30 +02:00
Ettore Di Giacinto
73fc702b3c
fix: this is not needed
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 10:28:53 +02:00
Ettore Di Giacinto
e3af62ae1a
feat: Add sycl support for whisper.cpp ( #5341 )
2025-05-09 09:31:02 +02:00
Ettore Di Giacinto
dc21604741
chore(deps): bump whisper.cpp ( #5338 )
...
* chore(deps): bump whisper.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* add libggml-metal
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups macOS arm64
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* adjust cublas for whisper.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-09 08:17:45 +02:00
LocalAI [bot]
5433f1a70e
chore: ⬆️ Update ggml-org/llama.cpp to f05a6d71a0f3dbf0730b56a1abbad41c0f42e63d ( #5340 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-08 23:13:28 +00:00
Ettore Di Giacinto
d5e032bdcd
chore(model gallery): add gemma-3-12b-fornaxv.2-qat-cot ( #5337 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-08 12:07:25 +02:00
Ettore Di Giacinto
de786f6586
chore(model gallery): add symiotic-14b-i1 ( #5336 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-08 12:03:35 +02:00
Ettore Di Giacinto
8b9bc4aa6e
chore(model gallery): add qwen3-14b-uncensored ( #5335 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-08 11:59:26 +02:00
Ettore Di Giacinto
e6cea7d28e
chore(model gallery): add cognition-ai_kevin-32b ( #5334 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-08 11:57:12 +02:00
Ettore Di Giacinto
7d7d56f2ce
chore(model gallery): add servicenow-ai_apriel-nemotron-15b-thinker ( #5333 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-08 11:55:35 +02:00
Ettore Di Giacinto
1caae91ab6
chore(model gallery): add qwen3-4b-esper3-i1 ( #5332 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-08 11:52:02 +02:00
LocalAI [bot]
e90f2cb0ca
chore: ⬆️ Update ggml-org/llama.cpp to 814f795e063c257f33b921eab4073484238a151a ( #5331 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-08 09:25:13 +02:00
Ettore Di Giacinto
5a4291fadd
docs: update README badges
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-07 22:20:06 +02:00
Ettore Di Giacinto
91ef58ee5a
chore(model gallery): add qwen3-14b-griffon-i1 ( #5330 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-07 11:07:38 +02:00
LocalAI [bot]
a86e8c78f1
chore: ⬆️ Update ggml-org/llama.cpp to 91a86a6f354aa73a7aab7bc3d283be410fdc93a5 ( #5329 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-06 23:39:10 +00:00
Ettore Di Giacinto
adb24214c6
chore(deps): bump llama.cpp to b34c859146630dff136943abc9852ca173a7c9d6 ( #5323 )
...
chore(deps): bump llama.cpp to 'b34c859146630dff136943abc9852ca173a7c9d6'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-06 11:21:25 +02:00
Ettore Di Giacinto
f03a0430aa
chore(model gallery): add claria-14b ( #5326 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-06 10:48:03 +02:00
Ettore Di Giacinto
73bc12abc0
chore(model gallery): add goekdeniz-guelmez_josiefied-qwen3-8b-abliterated-v1 ( #5325 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-06 10:38:20 +02:00
Ettore Di Giacinto
7fa437bbcc
chore(model gallery): add huihui-ai_qwen3-14b-abliterated ( #5324 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-06 10:35:55 +02:00
LocalAI [bot]
4a27c99928
chore(model-gallery): ⬆️ update checksum ( #5321 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-06 10:01:28 +02:00
Ettore Di Giacinto
6ce94834b6
fix(hipblas): do not build all cpu-specific flags ( #5322 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-06 10:00:50 +02:00
dependabot[bot]
84a26458dc
chore(deps): bump mxschmitt/action-tmate from 3.21 to 3.22 ( #5319 )
...
Bumps [mxschmitt/action-tmate](https://github.com/mxschmitt/action-tmate ) from 3.21 to 3.22.
- [Release notes](https://github.com/mxschmitt/action-tmate/releases )
- [Changelog](https://github.com/mxschmitt/action-tmate/blob/master/RELEASE.md )
- [Commits](https://github.com/mxschmitt/action-tmate/compare/v3.21...v3.22 )
---
updated-dependencies:
- dependency-name: mxschmitt/action-tmate
dependency-version: '3.22'
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-05-05 22:17:59 +00:00
Ettore Di Giacinto
7aa377b6a9
fix(arm64): do not build instructions which are not available ( #5318 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-05 17:30:00 +02:00
Ettore Di Giacinto
64e66dda4a
chore(model gallery): add allura-org_remnant-qwen3-8b ( #5317 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-05 11:09:07 +02:00
LocalAI [bot]
a085f61fdc
chore: ⬆️ Update ggml-org/llama.cpp to 9fdfcdaeddd1ef57c6d041b89cd8fb7048a0f028 ( #5316 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-04 23:00:25 +00:00
Ettore Di Giacinto
21bdfe5fa4
fix: use rice when embedding large binaries ( #5309 )
...
* fix(embed): use go-rice for large backend assets
Golang embed FS has a hard limit that we might exceed when providing
many binary alternatives.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* simplify golang deps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(tests): switch to testcontainers and print logs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(tests): do not build a test binary
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* small fixup
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-04 16:42:42 +02:00
Ettore Di Giacinto
7ebd7b2454
chore(model gallery): add rei-v3-kto-12b ( #5313 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-04 09:41:35 +02:00
Ettore Di Giacinto
6984749ea1
chore(model gallery): add kalomaze_qwen3-16b-a3b ( #5312 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-04 09:39:38 +02:00
Ettore Di Giacinto
c0a206bc7a
chore(model gallery): add qwen3-30b-a1.5b-high-speed ( #5311 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-04 09:38:01 +02:00
LocalAI [bot]
01bbb31fb3
chore: ⬆️ Update ggml-org/llama.cpp to 36667c8edcded08063ed51c7d57e9e086bbfc903 ( #5300 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-04 09:23:01 +02:00
Ettore Di Giacinto
72111c597d
fix(gpu): do not assume gpu being returned has node and mem ( #5310 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 19:00:24 +02:00
Ettore Di Giacinto
b2f9fc870b
chore(defaults): enlarge defaults, drop gpu layers which is infered ( #5308 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 18:44:51 +02:00
Ettore Di Giacinto
1fc6d469ac
chore(deps): bump llama.cpp to '1d36b3670b285e69e58b9d687c770a2a0a192194 ( #5307 )
...
chore(deps): bump llama.cpp to '1d36b3670b285e69e58b9d687c770a2a0a192194'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 18:44:40 +02:00
Ettore Di Giacinto
05848b2027
chore(model gallery): add smoothie-qwen3-8b ( #5306 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 10:35:20 +02:00
Ettore Di Giacinto
1da0644aa3
chore(model gallery): add qwen-3-32b-medical-reasoning-i1 ( #5305 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 10:24:07 +02:00
Ettore Di Giacinto
c087cd1377
chore(model gallery): add amoral-qwen3-14b ( #5304 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 10:21:48 +02:00
Ettore Di Giacinto
c621412f6a
chore(model gallery): add comet_12b_v.5-i1 ( #5303 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 10:20:03 +02:00
Ettore Di Giacinto
5a8b1892cd
chore(model gallery): add genericrpv3-4b ( #5302 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 10:18:31 +02:00
Ettore Di Giacinto
5b20426863
chore(model gallery): add planetoid_27b_v.2 ( #5301 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-03 10:14:33 +02:00
Ettore Di Giacinto
5c6cd50ed6
feat(llama.cpp): estimate vram usage ( #5299 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 17:40:26 +02:00
Ettore Di Giacinto
bace6516f1
chore(model gallery): add webthinker-qwq-32b-i1 ( #5298 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 09:57:49 +02:00
Ettore Di Giacinto
3baadf6f27
chore(model gallery): add shuttleai_shuttle-3.5 ( #5297 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 09:48:11 +02:00
Ettore Di Giacinto
8804c701b8
chore(model gallery): add microsoft_phi-4-reasoning ( #5296 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 09:46:20 +02:00
Ettore Di Giacinto
7b3ceb19bb
chore(model gallery): add microsoft_phi-4-reasoning-plus ( #5295 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 09:43:38 +02:00
Ettore Di Giacinto
e7f3effea1
chore(model gallery): add furina-8b ( #5294 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 09:39:22 +02:00
Ettore Di Giacinto
61694a2ffb
chore(model gallery): add josiefied-qwen3-8b-abliterated-v1 ( #5293 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-02 09:36:35 +02:00
LocalAI [bot]
573a3f104c
chore: ⬆️ Update ggml-org/llama.cpp to d7a14c42a1883a34a6553cbfe30da1e1b84dfd6a ( #5292 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-02 09:21:38 +02:00
Ettore Di Giacinto
0e8af53a5b
chore: update quickstart
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-01 22:36:33 +02:00
Ettore Di Giacinto
960ffa808c
chore(model gallery): add microsoft_phi-4-mini-reasoning ( #5288 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-01 10:17:58 +02:00
Ettore Di Giacinto
92719568e5
chore(model gallery): add fast-math-qwen3-14b ( #5287 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-01 10:14:51 +02:00
Ettore Di Giacinto
163939af71
chore(model gallery): add qwen3-8b-jailbroken ( #5286 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-01 10:13:01 +02:00
Ettore Di Giacinto
399f1241dc
chore(model gallery): add qwen3-30b-a3b-abliterated ( #5285 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-05-01 10:07:42 +02:00
LocalAI [bot]
58c9ade2e8
chore: ⬆️ Update ggml-org/llama.cpp to 3e168bede4d27b35656ab8026015b87659ecbec2 ( #5284 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-05-01 10:01:39 +02:00
Ettore Di Giacinto
6e1c93d84f
fix(ci): comment out vllm tests
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-05-01 10:01:22 +02:00
Wyatt Neal
4076ea0494
fix: vllm missing logprobs ( #5279 )
...
* working to address missing items
referencing #3436 , #2930 - if i could test it, this might show that the
output from the vllm backend is processed and returned to the user
Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com >
* adding in vllm tests to test-extras
Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com >
* adding in tests to pipeline for execution
Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com >
* removing todo block, test via pipeline
Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com >
---------
Signed-off-by: Wyatt Neal <wyatt.neal+git@gmail.com >
2025-04-30 12:55:07 +00:00
Ettore Di Giacinto
26cbf77c0d
chore(model gallery): add mlabonne_qwen3-4b-abliterated ( #5283 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-30 11:09:58 +02:00
Ettore Di Giacinto
640790d628
chore(model gallery): add mlabonne_qwen3-8b-abliterated ( #5282 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-30 11:08:26 +02:00
Ettore Di Giacinto
4132adea2f
chore(model gallery): add mlabonne_qwen3-14b-abliterated ( #5281 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-30 11:04:49 +02:00
LocalAI [bot]
2b2d907a3a
chore: ⬆️ Update ggml-org/llama.cpp to e2e1ddb93a01ce282e304431b37e60b3cddb6114 ( #5278 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-29 21:46:08 +00:00
Ettore Di Giacinto
6e8f4f584b
fix(diffusers): consider options only in form of key/value ( #5277 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 17:08:55 +02:00
Richard Palethorpe
662cfc2b48
fix(aio): Fix copypasta in download files for gpt-4 model ( #5276 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-29 17:08:16 +02:00
Ettore Di Giacinto
a25d355d66
chore(model gallery): add qwen3-0.6b ( #5275 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 10:10:16 +02:00
Ettore Di Giacinto
6d1cfdbefc
chore(model gallery): add qwen3-1.7b ( #5274 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 10:06:03 +02:00
Ettore Di Giacinto
5ecc478968
chore(model gallery): add qwen3-4b ( #5273 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 10:01:22 +02:00
Ettore Di Giacinto
aef5c4291b
chore(model gallery): add qwen3-8b ( #5272 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 09:59:17 +02:00
Ettore Di Giacinto
c059f912b9
chore(model gallery): add qwen3-14b ( #5271 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 09:56:50 +02:00
LocalAI [bot]
bc1e059259
chore: ⬆️ Update ggml-org/llama.cpp to 5f5e39e1ba5dbea814e41f2a15e035d749a520bc ( #5267 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-29 09:49:42 +02:00
LocalAI [bot]
38dc07793a
chore(model-gallery): ⬆️ update checksum ( #5268 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-29 09:49:23 +02:00
Ettore Di Giacinto
da6ef0967d
chore(model gallery): add qwen3-32b ( #5270 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 09:48:28 +02:00
Ettore Di Giacinto
7a011e60bd
chore(model gallery): add qwen3-30b-a3b ( #5269 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-29 09:44:44 +02:00
dependabot[bot]
e13dd5b09f
chore(deps): bump appleboy/scp-action from 0.1.7 to 1.0.0 ( #5265 )
...
Bumps [appleboy/scp-action](https://github.com/appleboy/scp-action ) from 0.1.7 to 1.0.0.
- [Release notes](https://github.com/appleboy/scp-action/releases )
- [Changelog](https://github.com/appleboy/scp-action/blob/master/.goreleaser.yaml )
- [Commits](https://github.com/appleboy/scp-action/compare/v0.1.7...v1.0.0 )
---
updated-dependencies:
- dependency-name: appleboy/scp-action
dependency-version: 1.0.0
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-04-28 22:36:30 +00:00
Ettore Di Giacinto
86ee303bd6
chore(model gallery): add nvidia_openmath-nemotron-14b-kaggle ( #5264 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-28 19:52:36 +02:00
Ettore Di Giacinto
978ee96fd3
chore(model gallery): add nvidia_openmath-nemotron-14b ( #5263 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-28 19:43:49 +02:00
Ettore Di Giacinto
3ad5691db6
chore(model gallery): add nvidia_openmath-nemotron-7b ( #5262 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-28 19:41:59 +02:00
Ettore Di Giacinto
0027681090
chore(model gallery): add nvidia_openmath-nemotron-1.5b ( #5261 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-28 19:40:09 +02:00
Ettore Di Giacinto
8cba990edc
chore(model gallery): add nvidia_openmath-nemotron-32b ( #5260 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-28 19:36:57 +02:00
Simon Redman
88857696d4
fix(CUDA): Add note for how to run CUDA with SELinux ( #5259 )
...
* Add note to help run nvidia containers with SELinux
* Use correct CUDA container references as noted in the dockerhub overview
* Clean trailing whitespaces
2025-04-28 09:00:52 +02:00
LocalAI [bot]
23f347e687
chore: ⬆️ Update ggml-org/llama.cpp to ced44be34290fab450f8344efa047d8a08e723b4 ( #5258 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-27 21:59:35 +00:00
Mohit Gaur
b6e3dc5f02
docs: update docs for DisableWebUI flag ( #5256 )
...
Signed-off-by: Mohit Gaur <56885276+Mohit-Gaur@users.noreply.github.com >
2025-04-27 16:02:02 +02:00
Alessandro Pirastru
69667521e2
fix(install/gpu):Fix docker not being able to leverage the GPU on systems that have SELinux Enforced ( #5252 )
...
* Update installation script for improved compatibility and clarity
- Renamed VERSION to LOCALAI_VERSION to avoid conflicts with system variables.
- Enhanced NVIDIA and CUDA repository installation for DNF5 compatibility.
- Adjusted default Fedora version handling for CUDA installation.
- Updated Docker image tag handling to use LOCALAI_VERSION consistently.
- Improved logging messages for repository and LocalAI binary downloads.
- Added a temporary bypass for nvidia-smi installation on Fedora Cloud Edition.
* feat: Add SELinux configuration for NVIDIA GPU support in containers
- Introduced `enable_selinux_container_booleans` function to handle SELinux configuration changes for GPU access.
- Included user confirmation prompt to enable SELinux `container_use_devices` boolean due to security implications.
- Added NVIDIA Container Runtime to Docker runtimes and restarted Docker to ensure proper GPU support.
- Applied SELinux adjustments conditionally for Fedora, RHEL, CentOS, Rocky, and openSUSE distributions.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* fix: Correct SELinux boolean parsing and add loop break
- Fixed incorrect parsing of `container_use_devices` boolean by changing the awk field from `$2` to `$3` to retrieve the correct value.
- Added a `break` statement after enabling the SELinux boolean to prevent unnecessary loop iterations after user prompt.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* fix: typo in install.sh
Signed-off-by: Alessandro Pirastru <57262788+Bloodis94@users.noreply.github.com >
---------
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
Signed-off-by: Alessandro Pirastru <57262788+Bloodis94@users.noreply.github.com >
2025-04-27 16:01:29 +02:00
LocalAI [bot]
2a92effc5d
chore: ⬆️ Update ggml-org/llama.cpp to 77d5e9a76a7b4a8a7c5bf9cf6ebef91860123cba ( #5254 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-27 09:21:02 +02:00
Simon Redman
a65e012aa2
docs(Vulkan): Add GPU docker documentation for Vulkan ( #5255 )
...
Add GPU docker documentation for Vulkan
2025-04-27 09:20:26 +02:00
Ettore Di Giacinto
8e9b41d05f
chore(ci): build only images with ffmpeg included, simplify tags ( #5251 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-27 08:23:25 +02:00
LocalAI [bot]
078da5c2f0
feat(swagger): update swagger ( #5253 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-26 22:40:35 +00:00
Ettore Di Giacinto
c5af5d139c
Update index.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-26 18:42:22 +02:00
Ettore Di Giacinto
2c9279a542
feat(video-gen): add endpoint for video generation ( #5247 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-26 18:05:01 +02:00
Ettore Di Giacinto
a67d22f5f2
chore(model gallery): add mmproj to gemma3 models (now working)
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-26 17:31:40 +02:00
Ettore Di Giacinto
dc7c51dcc7
chore(model gallery): fix correct filename for gemma-3-27b-it-qat
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-26 17:27:50 +02:00
Ettore Di Giacinto
98df65c7aa
chore(model gallery): add l3.3-genetic-lemonade-sunset-70b ( #5250 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-26 17:19:20 +02:00
Ettore Di Giacinto
1559b6b522
chore(model gallery): add l3.3-geneticlemonade-unleashed-v2-70b ( #5249 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-26 17:17:18 +02:00
Alessandro Pirastru
a0244e3fb4
feat(install): added complete process for installing nvidia drivers on fedora without pulling X11 ( #5246 )
...
* Update installation script for improved compatibility and clarity
- Renamed VERSION to LOCALAI_VERSION to avoid conflicts with system variables.
- Enhanced NVIDIA and CUDA repository installation for DNF5 compatibility.
- Adjusted default Fedora version handling for CUDA installation.
- Updated Docker image tag handling to use LOCALAI_VERSION consistently.
- Improved logging messages for repository and LocalAI binary downloads.
- Added a temporary bypass for nvidia-smi installation on Fedora Cloud Edition.
* Enhance log functions with ANSI color formatting
- Added ANSI escape codes for improved log styling: light blue for info, orange for warnings, and red for errors.
- Updated all log functions (`info`, `warn`, `fatal`) to include bold and colored output.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* feat: Enhance log functions with ANSI color formatting
- Added ANSI escape codes for improved log styling: light blue for info, orange for warnings, and red for errors.
- Updated all log functions (`info`, `warn`, `fatal`) to include bold and colored output.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* chore: ⬆️ Update ggml-org/llama.cpp to `ecda2ec4b347031a9b8a89ee2efc664ce63f599c` (#5238 )
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* fix(stablediffusion-ggml): Build with DSD CUDA, HIP and Metal flags (#5236 )
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* feat(install): enhance script with choice functions and logs
- Added custom `choice_info`, `choice_warn`, and `choice_fatal` functions for interactive input logging.
- Adjusted Docker volume creation message for better clarity.
- Included NVIDIA driver check log for improved feedback to users.
- Added consistent logging before starting LocalAI Docker containers across configurations.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* feat(install): add Fedora NVIDIA driver installation option
- Introduced a new function to install NVIDIA kernel drivers on Fedora using akmod packages.
- Added user prompt to choose between installing drivers automatically or exiting for manual setup.
- Integrated the new function into the existing Fedora-specific CUDA toolkit installation workflow.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* fix(install): correct repository ID for DNF5 configuration
- Update repository ID from 'nome-repo' to 'nvidia-cuda' for DNF5.
- Ensures the correct repository name matches expected configuration.
- Fix prevents potential misconfiguration during installation process.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* feat(install): enhance NVIDIA driver handling on Fedora
- fixed `install_cuda_driver_yum` function call in `install_fedora_nvidia_kernel_drivers`
- Added `cuda-toolkit` for Fedora installations, as recommended by RPM Fusion.
- Adjusted driver repository commands for compatibility with DNF5.
- Improved URL and version handling for package manager installations.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* Refactor NVIDIA driver installation process in install.sh
- Removed redundant empty lines for cleaner formatting.
- Standardized URL formatting by removing unnecessary quotes around URLs.
- Reverted logic by removing Fedora-specific exclusions for cuda-toolkit and using `cuda-drivers` universally.
- Refined repository addition for `dnf` by explicitly setting `id` and `name` parameters for clarity and accuracy.
- Fixed minor formatting inconsistencies in parameter passing.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* feat: Update NVIDIA module installation warning in install script
- Clarified that Akmod installation may inhibit the reboot command.
- Added a cautionary note to the warning to inform users of potential risks.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
* Update NVIDIA driver installation warning message
- Clarify prerequisites by noting the need for rpmfusion free/nonfree repos.
- Improve formatting of the warning box for better readability.
- Inform users that the script will install missing repos if necessary.
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
---------
Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com >
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: LocalAI [bot] <139863280+localai-bot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Richard Palethorpe <io@richiejp.com >
2025-04-26 09:44:40 +02:00
LocalAI [bot]
d66396201a
chore: ⬆️ Update ggml-org/llama.cpp to 295354ea6848a77bdee204ee1c971d9b92ffcca9 ( #5245 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-26 00:05:16 +02:00
Ettore Di Giacinto
9628860c0e
feat(llama.cpp/clip): inject gpu options if we detect GPUs ( #5243 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-26 00:04:47 +02:00
Ettore Di Giacinto
cae9bf1308
chore(deps): bump grpcio to 1.72.0 ( #5244 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-25 21:32:37 +02:00
Ettore Di Giacinto
5bb5da0760
fix(ci): add clang ( #5242 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-25 16:20:05 +02:00
Ettore Di Giacinto
867973a850
chore(model gallery): add soob3123_veritas-12b ( #5241 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-25 09:20:01 +02:00
LocalAI [bot]
701cd6b6d5
chore: ⬆️ Update ggml-org/llama.cpp to 226251ed56b85190e18a1cca963c45b888f4953c ( #5240 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-25 08:42:22 +02:00
Richard Palethorpe
7f61d397d5
fix(stablediffusion-ggml): Build with DSD CUDA, HIP and Metal flags ( #5236 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-24 10:27:17 +02:00
Alessandro Pirastru
1ae0b896fa
fix: installation script compatibility with fedora 41 and later, fedora headless unclear errors ( #5239 )
...
Update installation script for improved compatibility and clarity
- Renamed VERSION to LOCALAI_VERSION to avoid conflicts with system variables.
- Enhanced NVIDIA and CUDA repository installation for DNF5 compatibility.
- Adjusted default Fedora version handling for CUDA installation.
- Updated Docker image tag handling to use LOCALAI_VERSION consistently.
- Improved logging messages for repository and LocalAI binary downloads.
- Added a temporary bypass for nvidia-smi installation on Fedora Cloud Edition.
2025-04-24 09:34:25 +02:00
LocalAI [bot]
3937407cb3
chore: ⬆️ Update ggml-org/llama.cpp to ecda2ec4b347031a9b8a89ee2efc664ce63f599c ( #5238 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-24 09:32:08 +02:00
LocalAI [bot]
0e34ae4f3f
chore: ⬆️ Update ggml-org/llama.cpp to 658987cfc9d752dca7758987390d5fb1a7a0a54a ( #5234 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-23 09:13:49 +02:00
dependabot[bot]
a38b99ecb6
chore(deps): bump mxschmitt/action-tmate from 3.19 to 3.21 ( #5231 )
...
Bumps [mxschmitt/action-tmate](https://github.com/mxschmitt/action-tmate ) from 3.19 to 3.21.
- [Release notes](https://github.com/mxschmitt/action-tmate/releases )
- [Changelog](https://github.com/mxschmitt/action-tmate/blob/master/RELEASE.md )
- [Commits](https://github.com/mxschmitt/action-tmate/compare/v3.19...v3.21 )
---
updated-dependencies:
- dependency-name: mxschmitt/action-tmate
dependency-version: '3.21'
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-04-22 10:27:10 +02:00
LocalAI [bot]
a4a4358182
chore: ⬆️ Update ggml-org/llama.cpp to 1d735c0b4fa0551c51c2f4ac888dd9a01f447985 ( #5233 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-22 10:25:54 +02:00
Ettore Di Giacinto
4bc39c2db3
fix: typo on README link
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-21 22:13:14 +02:00
Ettore Di Giacinto
cc3df759f8
chore(docs): improve installer.sh docs ( #5232 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-21 22:11:43 +02:00
LocalAI [bot]
378161060c
chore: ⬆️ Update ggml-org/llama.cpp to 6602304814e679cc8c162bb760a034aceb4f8965 ( #5228 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-20 21:44:33 +00:00
Ettore Di Giacinto
f2f788fe60
chore(model gallery): add starrysky-12b-i1 ( #5224 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-20 10:26:30 +02:00
Ettore Di Giacinto
9fa8ed6b1e
chore(model gallery) add amoral-gemma3-1b-v2 ( #5223 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-20 10:23:24 +02:00
Ettore Di Giacinto
7fc37c5e29
chore(model gallery) add llama_3.3_70b_darkhorse-i1 ( #5222 )
...
chore(model gallery): add llama_3.3_70b_darkhorse-i1
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-20 10:20:58 +02:00
Ettore Di Giacinto
4bc4b1e8bc
chore(model gallery) update gemma3 qat models
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-20 10:11:12 +02:00
LocalAI [bot]
e495b89f18
chore: ⬆️ Update ggml-org/llama.cpp to 00137157fca3d17b90380762b4d7cc158d385bd3 ( #5218 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-19 23:50:35 +00:00
LocalAI [bot]
ba09eaea1b
feat(swagger): update swagger ( #5217 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-19 22:06:30 +02:00
Ettore Di Giacinto
61cc76c455
chore(autogptq): drop archived backend ( #5214 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-19 15:52:29 +02:00
Ettore Di Giacinto
8abecb4a18
chore: bump grpc limits to 50MB ( #5212 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-19 08:53:24 +02:00
LocalAI [bot]
8b3f76d8e6
chore: ⬆️ Update ggml-org/llama.cpp to 6408210082cc0a61b992b487be7e2ff2efbb9e36 ( #5211 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-18 21:45:48 +00:00
Ettore Di Giacinto
4e0497f1a6
chore(model gallery): add pictor-1338-qwenp-1.5b ( #5208 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-18 10:47:23 +02:00
Ettore Di Giacinto
ba88c9f451
chore(ci): use gemma-3-12b-it for models notifications (twitter)
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-18 10:38:36 +02:00
Ettore Di Giacinto
a598285825
chore(model gallery): add google-gemma-3-27b-it-qat-q4_0-small ( #5207 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-18 10:35:48 +02:00
Ettore Di Giacinto
cb7a172897
chore(ci): use gemma-3-12b-it for models notifications
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-18 10:20:33 +02:00
Ettore Di Giacinto
771be28dfb
ci: use gemma3 for notifications of releases
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-18 10:19:52 +02:00
Ettore Di Giacinto
7d6b3eb42d
chore(model gallery): add readyart_amoral-fallen-omega-gemma3-12b ( #5206 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-18 10:17:39 +02:00
Ettore Di Giacinto
0bb33fab55
chore(model gallery): add ibm-granite_granite-3.3-2b-instruct ( #5205 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-18 10:15:05 +02:00
Ettore Di Giacinto
e3bf7f77f7
chore(model gallery): add ibm-granite_granite-3.3-8b-instruct ( #5204 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-18 09:59:17 +02:00
LocalAI [bot]
bd1707d339
chore: ⬆️ Update ggml-org/llama.cpp to 2f74c354c0f752ed9aabf7d3a350e6edebd7e744 ( #5203 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-17 21:52:12 +00:00
Ettore Di Giacinto
0474804541
fix(ci): remove duplicate entry
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 19:51:21 +02:00
Ettore Di Giacinto
72693b3917
feat(install.sh): allow to uninstall with --uninstall ( #5202 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 16:32:23 +02:00
Florian Bachmann
a03b70010f
fix(talk): Talk interface sends content-type headers to chatgpt ( #5200 )
...
Talk interface sends content-type headers to chatgpt
Signed-off-by: baflo <834350+baflo@users.noreply.github.com >
2025-04-17 15:02:11 +02:00
Ettore Di Giacinto
e3717e5c1a
chore(model gallery): add qwen2.5-14b-instruct-1m ( #5201 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 10:42:22 +02:00
Ettore Di Giacinto
c8f6858218
chore(ci): add latest images for core ( #5198 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 10:00:18 +02:00
Ettore Di Giacinto
06d7cc43ae
chore(model gallery): add dreamgen_lucid-v1-nemo ( #5196 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 09:10:09 +02:00
Ettore Di Giacinto
f2147cb850
chore(model gallery): add thedrummer_rivermind-12b-v1 ( #5195 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 09:02:54 +02:00
Ettore Di Giacinto
75bb9f4c28
chore(model gallery): add menlo_rezero-v0.1-llama-3.2-3b-it-grpo-250404 ( #5194 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-17 09:00:11 +02:00
LocalAI [bot]
a2ef4b1e07
chore: ⬆️ Update ggml-org/llama.cpp to 015022bb53387baa8b23817ac03743705c7d472b ( #5192 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-17 08:04:37 +02:00
LocalAI [bot]
161c9fe2db
docs: ⬆️ update docs version mudler/LocalAI ( #5191 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-16 22:13:49 +02:00
Ettore Di Giacinto
7547463f81
Update quickstart.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-16 08:48:55 +02:00
Gianluca Boiano
32e4dfd47b
chore(model gallery): add suno-ai bark-cpp model ( #5187 )
...
Signed-off-by: Gianluca Boiano <morf3089@gmail.com >
2025-04-16 08:22:46 +02:00
Gianluca Boiano
f67e5dec68
fix: bark-cpp: assign FLAG_TTS to bark-cpp backend ( #5186 )
...
Signed-off-by: Gianluca Boiano <morf3089@gmail.com >
2025-04-16 08:21:30 +02:00
LocalAI [bot]
297d54acea
chore: ⬆️ Update ggml-org/llama.cpp to 80f19b41869728eeb6a26569957b92a773a2b2c6 ( #5183 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-15 22:50:32 +00:00
Ettore Di Giacinto
56f44d448c
chore(docs): decrease logo size, minor enhancements
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-15 22:00:51 +02:00
Richard Palethorpe
0f0fafacd9
fix(stablediffusion): Avoid overwriting SYCL specific flags from outer make call ( #5181 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-15 19:31:25 +02:00
Ettore Di Giacinto
4f239bac89
feat: rebrand - LocalAGI and LocalRecall joins the LocalAI stack family ( #5159 )
...
* wip
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update lotusdocs and hugo
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* rephrasing
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Latest fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adjust readme section
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-15 17:51:24 +02:00
Ettore Di Giacinto
04d74ac648
chore(model gallery): add m1-32b ( #5182 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-15 17:17:17 +02:00
Richard Palethorpe
18c3dc33ee
fix(stablediffusion): Pass ROCM LD CGO flags through to recursive make ( #5179 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-15 09:27:29 +02:00
LocalAI [bot]
508cfa7369
chore: ⬆️ Update ggml-org/llama.cpp to d6d2c2ab8c8865784ba9fef37f2b2de3f2134d33 ( #5178 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-14 23:10:16 +02:00
Ettore Di Giacinto
1f94cddbae
chore(model gallery): add nvidia_llama-3.1-8b-ultralong-4m-instruct ( #5177 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-14 12:30:55 +02:00
Ettore Di Giacinto
21ae7b4cd4
chore(model gallery): add nvidia_llama-3.1-8b-ultralong-1m-instruct ( #5176 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-14 12:28:09 +02:00
Ettore Di Giacinto
bef22ab547
chore(model gallery): add skywork_skywork-or1-32b-preview ( #5175 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-14 12:25:43 +02:00
Ettore Di Giacinto
eb04e8cdcf
chore(model gallery): add skywork_skywork-or1-math-7b ( #5174 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-14 12:23:44 +02:00
Ettore Di Giacinto
17e533a086
chore(model gallery): add skywork_skywork-or1-7b-preview ( #5173 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-14 12:20:20 +02:00
qwerty108109
4fc68409ff
Update README.md ( #5172 )
...
Modified the README.md to separate out the different docker run commands to make it easier to copy into the terminal.
Signed-off-by: qwerty108109 <97707491+qwerty108109@users.noreply.github.com >
2025-04-14 10:48:10 +02:00
Richard Palethorpe
e587044449
fix(stablediffusion): Avoid GGML commit which causes CUDA compile error ( #5170 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-14 09:29:09 +02:00
LocalAI [bot]
1f09db5161
chore: ⬆️ Update ggml-org/llama.cpp to 71e90e8813f90097701e62f7fce137d96ddf41e2 ( #5171 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-13 21:46:07 +00:00
Ettore Di Giacinto
05b744f086
chore(model gallery): add daichi-12b ( #5169 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-13 15:53:11 +02:00
Ettore Di Giacinto
89ca4bc02d
chore(model gallery): add hamanasu-magnum-4b-i1 ( #5168 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-13 14:37:59 +02:00
Ettore Di Giacinto
e626aa48a4
chore(model gallery): add hamanasu-adventure-4b-i1 ( #5167 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-13 14:35:57 +02:00
Ettore Di Giacinto
752b5e0339
chore(model gallery): add mag-picaro-72b ( #5166 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-13 14:34:14 +02:00
Ettore Di Giacinto
637d72d6e3
chore(model gallery): add lightthinker-qwen ( #5165 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-13 14:31:05 +02:00
LocalAI [bot]
f3bfec580a
chore: ⬆️ Update ggml-org/llama.cpp to bc091a4dc585af25c438c8473285a8cfec5c7695 ( #5158 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-13 08:23:41 +00:00
Ettore Di Giacinto
165c1ddff3
chore(model gallery): add tesslate_gradience-t1-3b-preview ( #5160 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-12 10:37:40 +02:00
Ettore Di Giacinto
fb83238e9e
chore(model gallery): add zyphra_zr1-1.5b ( #5157 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-11 10:06:05 +02:00
Ettore Di Giacinto
700bfa41c7
chore(model gallery): add agentica-org_deepcoder-1.5b-preview ( #5156 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-11 10:03:59 +02:00
LocalAI [bot]
25bdc350df
chore: ⬆️ Update ggml-org/llama.cpp to 64eda5deb9859e87a020e56bab5d2f9ca956f1de ( #5155 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-10 21:44:55 +00:00
Richard Palethorpe
1b899e1a68
feat(stablediffusion): Enable SYCL ( #5144 )
...
* feat(sycl): Enable SYCL for stable diffusion
This is a pain because we compile with CGO, but SD is compiled with
CMake. I don't think we can easily use CMake to set the linker flags
necessary. Also I could not find pkg-config calls that would fully set
the flags, so some of them are set manually.
See https://www.intel.com/content/www/us/en/developer/tools/oneapi/onemkl-link-line-advisor.html
for reference. I also resorted to searching the shared object files in
MKLROOT/lib for the symbols.
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(ci): Don't set nproc on cmake
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-10 15:20:53 +02:00
Ettore Di Giacinto
3bf13f8c69
chore(model gallery): add soob3123_amoral-cogito-v1-preview-qwen-14b ( #5154 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-10 10:07:56 +02:00
Ettore Di Giacinto
7a00729374
chore(model gallery): add trappu_magnum-picaro-0.7-v2-12b ( #5153 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-10 10:03:42 +02:00
Ettore Di Giacinto
d484028532
feat(diffusers): add support for Lumina2Text2ImgPipeline ( #4806 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-10 09:55:51 +02:00
LocalAI [bot]
0eb7fc2c41
chore: ⬆️ Update ggml-org/llama.cpp to d3bd7193ba66c15963fd1c59448f22019a8caf6e ( #5152 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-09 22:01:25 +00:00
Ettore Di Giacinto
a69e30e0c9
chore(model gallery): add agentica-org_deepcoder-14b-preview ( #5151 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-09 16:55:47 +02:00
Ettore Di Giacinto
9c018e6bff
chore(model gallery): add deepcogito_cogito-v1-preview-llama-70b ( #5150 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-09 16:54:59 +02:00
Ettore Di Giacinto
281e818047
chore(model gallery): add deepcogito_cogito-v1-preview-llama-70b ( #5150 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-09 16:53:28 +02:00
Ettore Di Giacinto
270f0e2157
chore(model gallery): add deepcogito_cogito-v1-preview-qwen-32b ( #5149 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-09 16:48:15 +02:00
Ettore Di Giacinto
673e59e76c
chore(model gallery): add deepcogito_cogito-v1-preview-llama-3b ( #5148 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-09 16:42:53 +02:00
LocalAI [bot]
5a8a2adb44
chore: ⬆️ Update ggml-org/llama.cpp to b32efad2bc42460637c3a364c9554ea8217b3d7f ( #5146 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-09 15:39:04 +02:00
Ettore Di Giacinto
a7317d23bf
chore(model gallery): add deepcogito_cogito-v1-preview-llama-8b ( #5147 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-09 10:02:09 +02:00
Ettore Di Giacinto
2bab9b5fe2
fix: fix gallery name for cogito-v1-preview-qwen-14B
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-04-08 22:15:32 +02:00
Ettore Di Giacinto
081be3ba7d
chore(model gallery): add cogito-v1-preview-qwen-14b ( #5145 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 22:04:14 +02:00
Ettore Di Giacinto
25e6f21322
chore(deps): bump llama.cpp to 4ccea213bc629c4eef7b520f7f6c59ce9bbdaca0 ( #5143 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 11:26:06 +02:00
Ettore Di Giacinto
b4df1c9cf3
fix(gemma): improve prompt for tool calls ( #5142 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 10:12:42 +02:00
Ettore Di Giacinto
4fbd6609f2
chore(model gallery): add meta-llama_llama-4-scout-17b-16e-instruct ( #5141 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 10:12:28 +02:00
Ettore Di Giacinto
7387932f89
chore(model gallery): add mensa-beta-14b-instruct-i1 ( #5140 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 10:01:24 +02:00
Ettore Di Giacinto
59c37e67b2
chore(model gallery): add eurydice-24b-v2-i1 ( #5139 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 09:56:29 +02:00
Ettore Di Giacinto
c09d227647
chore(model gallery): add watt-ai_watt-tool-70b ( #5138 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 09:42:49 +02:00
Ettore Di Giacinto
547d322b28
chore(model gallery): add arliai_qwq-32b-arliai-rpr-v ( #5137 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-08 09:40:26 +02:00
dependabot[bot]
a6f0bb410f
chore(deps): bump securego/gosec from 2.22.0 to 2.22.3 ( #5134 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.0 to 2.22.3.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.0...v2.22.3 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-04-07 21:09:45 +00:00
Ettore Di Giacinto
710f624ecd
fix(webui): improve model display, do not block view ( #5133 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-07 18:03:25 +02:00
LocalAI [bot]
5018452be7
chore: ⬆️ Update ggml-org/llama.cpp to 916c83bfe7f8b08ada609c3b8e583cf5301e594b ( #5130 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-06 21:51:51 +00:00
Ettore Di Giacinto
ece239966f
chore: ⬆️ Update ggml-org/llama.cpp to 6bf28f0111ff9f21b3c1b1eace20c590281e7ba6 ( #5127 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-06 14:01:51 +02:00
Ettore Di Giacinto
3b8bc7e64c
chore(model gallery): add open-thoughts_openthinker2-7b ( #5129 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-06 10:53:22 +02:00
Ettore Di Giacinto
fc73b2b430
chore(model gallery): add open-thoughts_openthinker2-32b ( #5128 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-06 10:48:21 +02:00
Ettore Di Giacinto
901dba6063
chore(model gallery): add gemma-3-27b-it-qat ( #5124 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-05 08:46:49 +02:00
LocalAI [bot]
b88a7a4550
chore: ⬆️ Update ggml-org/llama.cpp to 3e1d29348b5d77269f6931500dd1c1a729d429c8 ( #5123 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-04 21:49:53 +00:00
Ettore Di Giacinto
106e40845f
chore(model gallery): add katanemo_arch-function-chat-3b ( #5122 )
2025-04-04 10:45:44 +02:00
Ettore Di Giacinto
0064bec8f5
chore(model gallery): add katanemo_arch-function-chat-1.5b ( #5121 )
2025-04-04 10:31:44 +02:00
Ettore Di Giacinto
9e6dbb0b5a
chore(model gallery): add katanemo_arch-function-chat-7b ( #5120 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-04 10:29:47 +02:00
Ettore Di Giacinto
d26e61388b
chore(model gallery): add tesslate_synthia-s1-27b ( #5119 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-04 10:27:52 +02:00
Ettore Di Giacinto
31a7084c75
chore(model gallery): add gemma-3-4b-it-qat ( #5118 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-04 10:23:56 +02:00
Ettore Di Giacinto
128612a6fc
chore(model gallery): add gemma-3-12b-it-qat ( #5117 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-04 10:21:45 +02:00
LocalAI [bot]
6af3f46bc3
chore: ⬆️ Update ggml-org/llama.cpp to c262beddf29f3f3be5bbbf167b56029a19876956 ( #5116 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-03 22:59:49 +00:00
Richard Palethorpe
d2cf8ef070
fix(sycl): kernel not found error by forcing -fsycl ( #5115 )
...
* chore(sycl): Update oneapi to 2025:1
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(sycl): Pass -fsycl flag as workaround
-fsycl should be set by llama.cpp's cmake file, but something goes wrong
and it doesn't appear to get added
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(build): Speed up llama build by using all CPUs
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-04-03 16:22:59 +02:00
Ettore Di Giacinto
259ad3cfe6
chore(model gallery): add all-hands_openhands-lm-1.5b-v0.1 ( #5114 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-03 10:25:46 +02:00
Ettore Di Giacinto
18b320d577
chore(deps): bump llama.cpp to 'f01bd02376f919b05ee635f438311be8dfc91d7c ( #5110 )
...
chore(deps): bump llama.cpp to 'f01bd02376f919b05ee635f438311be8dfc91d7c'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-03 10:23:14 +02:00
Ettore Di Giacinto
89e151f035
chore(model gallery): add all-hands_openhands-lm-7b-v0.1 ( #5113 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-03 10:20:20 +02:00
Ettore Di Giacinto
22060f6410
chore(model gallery): add burtenshaw_gemmacoder3-12b ( #5112 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-03 10:17:57 +02:00
Ettore Di Giacinto
7ee3288460
chore(model gallery): add all-hands_openhands-lm-32b-v0.1 ( #5111 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-03 10:15:57 +02:00
LocalAI [bot]
cbbc954a8c
chore: ⬆️ Update ggml-org/llama.cpp to f423981ac806bf031d83784bcb47d2721bc70f97 ( #5108 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-02 09:22:53 +02:00
Ettore Di Giacinto
2c425e9c69
feat(loader): enhance single active backend by treating as singleton ( #5107 )
...
feat(loader): enhance single active backend by treating at singleton
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-01 20:58:11 +02:00
LocalAI [bot]
c59975ab05
chore: ⬆️ Update ggml-org/llama.cpp to c80a7759dab10657b9b6c3e87eef988a133b9b6a ( #5105 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-04-01 00:01:34 +02:00
Ettore Di Giacinto
05f7004487
fix: race during stop of active backends ( #5106 )
...
* chore: drop double call to stop all backends, refactors
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: do lock when cycling to models to delete
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-04-01 00:01:10 +02:00
Ettore Di Giacinto
2f9203cd2a
chore: drop remoteLibraryURL from kong vars ( #5103 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-31 22:48:17 +02:00
LocalAI [bot]
f09b33f2ef
docs: ⬆️ update docs version mudler/LocalAI ( #5104 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-31 22:48:03 +02:00
Ettore Di Giacinto
65470b0ab1
Update README
2025-03-31 21:51:09 +02:00
Ettore Di Giacinto
9a23fe662b
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-03-31 19:35:34 +02:00
LocalAI [bot]
6d7ac09e96
chore: ⬆️ Update ggml-org/llama.cpp to 4663bd353c61c1136cd8a97b9908755e4ab30cec ( #5100 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-30 21:59:30 +00:00
Ettore Di Giacinto
c2a39e3639
fix(llama.cpp): properly handle sigterm ( #5099 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-30 18:08:29 +02:00
Ettore Di Giacinto
ae625a4d00
chore(model gallery): add hammer2.0-7b ( #5098 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-30 09:50:21 +02:00
Ettore Di Giacinto
7f3a029596
chore(model gallery): add forgotten-abomination-70b-v5.0 ( #5097 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-30 09:48:24 +02:00
Ettore Di Giacinto
b34cf00819
chore(model gallery): add galactic-qwen-14b-exp1 ( #5096 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-30 09:46:51 +02:00
LocalAI [bot]
d4a10b4300
chore: ⬆️ Update ggml-org/llama.cpp to 0bb2919335d00ff0bc79d5015da95c422de51f03 ( #5095 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-29 21:40:45 +00:00
Ettore Di Giacinto
9c74d74f7b
feat(gguf): guess default context size from file ( #5089 )
...
feat(gguf): guess default config file from files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-29 14:42:14 +01:00
Ettore Di Giacinto
679ee7bea4
chore(model gallery): add chaoticneutrals_very_berry_qwen2_7b ( #5093 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-29 12:34:49 +01:00
Ettore Di Giacinto
77d7dc62c4
chore(model gallery): add tesslate_tessa-t1-3b ( #5092 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-29 12:15:28 +01:00
Ettore Di Giacinto
699519d1fe
chore(model gallery): add tesslate_tessa-t1-7b ( #5091 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-29 12:12:01 +01:00
Ettore Di Giacinto
8faf39d34e
chore(model gallery): add tesslate_tessa-t1-14b ( #5090 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-29 11:58:39 +01:00
Ettore Di Giacinto
5d261a6fcd
chore(model gallery): add tesslate_tessa-t1-32b ( #5088 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-29 11:53:47 +01:00
Ettore Di Giacinto
22d5727089
chore(model gallery): add tarek07_legion-v2.1-llama-70b ( #5087 )
2025-03-29 11:27:06 +01:00
LocalAI [bot]
c965197d6f
chore: ⬆️ Update ggml-org/llama.cpp to b4ae50810e4304d052e630784c14bde7e79e4132 ( #5085 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-28 21:37:18 +00:00
Ettore Di Giacinto
994a6c4939
chore(model gallery): fallen-safeword-70b-r1-v4.1 ( #5084 )
2025-03-28 15:20:38 +01:00
Ettore Di Giacinto
f926d2a72b
chore(model gallery): thoughtless-fallen-abomination-70b-r1-v4.1-i1 ( #5083 )
2025-03-28 15:11:54 +01:00
Ettore Di Giacinto
ddeb9ed93e
chore(model gallery): qwen2.5-14b-instruct-1m-unalign-i1 ( #5082 )
2025-03-28 15:08:33 +01:00
Ettore Di Giacinto
c7e99c7b59
chore(model gallery): gemma-3-starshine-12b-i1 ( #5081 )
2025-03-28 14:50:39 +01:00
Ettore Di Giacinto
6fabc92e56
chore(model gallery): add soob3123_amoral-gemma3-12b-v2 ( #5080 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-28 14:45:02 +01:00
LocalAI [bot]
4645b3c919
chore: ⬆️ Update ggml-org/llama.cpp to 5dec47dcd411fdf815a3708fd6194e2b13d19006 ( #5079 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-27 23:32:33 +00:00
Dave
134fe2705c
fix: ensure git-lfs is present ( #5078 )
...
devcontainer clean builds had issue with git-lfs -- should this be installed for _all_ images for safety?
Signed-off-by: Dave Lee <dave@gray101.com >
2025-03-27 22:23:28 +01:00
LocalAI [bot]
3cca32ba7e
chore: ⬆️ Update ggml-org/llama.cpp to b3298fa47a2d56ae892127ea038942ab1cada190 ( #5077 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-27 10:47:07 +01:00
Ettore Di Giacinto
c069e61b26
chore(model gallery): add textsynth-8b-i1 ( #5076 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-26 14:40:19 +01:00
Ettore Di Giacinto
7fa159e164
chore(model gallery): add blacksheep-24b-i1 ( #5075 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-26 14:37:30 +01:00
Ettore Di Giacinto
5f92025617
chore(model gallery): add gemma-3-glitter-12b-i1 ( #5074 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-26 10:19:00 +01:00
LocalAI [bot]
333e1bc732
chore: ⬆️ Update ggml-org/llama.cpp to ef19c71769681a0b3dde6bc90911728376e5d236 ( #5073 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-26 09:51:20 +01:00
Ettore Di Giacinto
e90b97c144
chore(model gallery): add alamios_mistral-small-3.1-draft-0.5b ( #5071 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-25 10:10:45 +01:00
Ettore Di Giacinto
747eeb1d46
chore(model gallery): add helpingai_helpingai3-raw ( #5070 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-25 10:09:00 +01:00
Ettore Di Giacinto
5d2c53abc0
chore(model gallery): add jdineen_llama-3.1-8b-think ( #5069 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-25 10:06:24 +01:00
LocalAI [bot]
0b1e721242
chore: ⬆️ Update ggml-org/llama.cpp to c95fa362b3587d1822558f7e28414521075f254f ( #5068 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-24 21:37:16 +00:00
Ettore Di Giacinto
8c76a9ce99
chore(model gallery): add dusk_rainbow ( #5066 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-24 09:49:32 +01:00
Ettore Di Giacinto
338321af5b
chore(model gallery): add eximius_persona_5b ( #5065 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-24 09:30:20 +01:00
Ettore Di Giacinto
2774a92484
chore(model gallery): add impish_llama_3b ( #5064 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-24 09:27:04 +01:00
LocalAI [bot]
1a6bfb41a1
chore: ⬆️ Update ggml-org/llama.cpp to 77f9c6bbe55fccd9ea567794024cb80943947901 ( #5062 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-23 21:37:14 +00:00
Ettore Di Giacinto
314981eaf8
chore(model gallery): add fiendish_llama_3b ( #5061 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 10:00:19 +01:00
Ettore Di Giacinto
d7266c633d
chore(model gallery): add sicariussicariistuff_x-ray_alpha ( #5060 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:56:35 +01:00
Ettore Di Giacinto
eb4d5f2b95
chore(model gallery): add mawdistical_mawdistic-nightlife-24b ( #5059 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:52:50 +01:00
Ettore Di Giacinto
c63b449ad6
chore(model gallery): add huihui-ai_gemma-3-1b-it-abliterated ( #5058 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:35:05 +01:00
Ettore Di Giacinto
dd4a778c2c
chore(model gallery): add thedrummer_fallen-gemma3-27b-v1 ( #5057 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:32:58 +01:00
Ettore Di Giacinto
a0896d21d6
chore(model gallery): add thedrummer_fallen-gemma3-12b-v1 ( #5056 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:31:37 +01:00
Ettore Di Giacinto
0e697f951a
chore(model gallery): add thedrummer_fallen-gemma3-4b-v1 ( #5055 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:30:17 +01:00
Ettore Di Giacinto
fa4bb9082d
chore(model gallery): add knoveleng_open-rs3 ( #5054 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-23 09:27:27 +01:00
LocalAI [bot]
8ff7b15441
chore: ⬆️ Update ggml-org/llama.cpp to ba932dfb50cc694645b1a148c72f8c06ee080b17 ( #5053 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-22 22:18:55 +00:00
LocalAI [bot]
dd45f85a20
chore: ⬆️ Update ggml-org/llama.cpp to 4375415b4abf94fb36a5fd15f233ac0ee23c0bd1 ( #5052 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-21 21:36:25 +00:00
Ettore Di Giacinto
decdd9e522
chore(model gallery): add luvgpt_phi3-uncensored-chat ( #5051 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-21 09:11:07 +01:00
Ettore Di Giacinto
31a21d4a2c
chore(model gallery): add sao10k_llama-3.3-70b-vulpecula-r1 ( #5050 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-21 09:08:55 +01:00
Ettore Di Giacinto
2c129843a7
chore(model gallery): add qwen-writerdemo-7b-s500-i1 ( #5049 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-21 09:03:39 +01:00
LocalAI [bot]
ce71a0bcfb
chore: ⬆️ Update ggml-org/llama.cpp to e04643063b3d240b8c0fdba98677dff6ba346784 ( #5047 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-20 21:34:51 +00:00
Ettore Di Giacinto
0a32c38317
chore(model gallery): add basic function template for gemma
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-20 09:32:21 +01:00
Ettore Di Giacinto
36f596f260
chore(model gallery): add soob3123_amoral-gemma3-4b ( #5046 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-20 09:30:04 +01:00
Ettore Di Giacinto
953552545b
chore(model gallery): add samsungsailmontreal_bytecraft ( #5045 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-20 09:27:33 +01:00
Ettore Di Giacinto
835e55b1de
chore(model gallery): add rootxhacker_apollo-v3-32b ( #5044 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-20 09:20:42 +01:00
Ettore Di Giacinto
dcd2921eaa
chore(model gallery): add gemma-3-4b-it-uncensored-dbl-x-i1 ( #5043 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-20 09:17:20 +01:00
LocalAI [bot]
5e6459fd18
chore: ⬆️ Update ggml-org/llama.cpp to 568013d0cd3d5add37c376b3d5e959809b711fc7 ( #5042 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-19 21:47:18 +00:00
Ettore Di Giacinto
50ddb3eb59
chore(model gallery): add nvidia_llama-3_3-nemotron-super-49b-v1 ( #5041 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-19 09:37:27 +01:00
Ettore Di Giacinto
5eebfee4b5
chore(model gallery): add gryphe_pantheon-rp-1.8-24b-small-3.1 ( #5040 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-19 09:32:47 +01:00
Ettore Di Giacinto
567919ea90
chore(model gallery): add mistralai_mistral-small-3.1-24b-instruct-2503 ( #5039 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-19 09:29:23 +01:00
LocalAI [bot]
27a3997530
chore(model-gallery): ⬆️ update checksum ( #5036 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-19 09:18:40 +01:00
LocalAI [bot]
192ba2c657
chore: ⬆️ Update ggml-org/llama.cpp to d84635b1b085d54d6a21924e6171688d6e3dfb46 ( #5035 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-18 22:23:39 +00:00
Ettore Di Giacinto
92abac9ca8
chore(model gallery): add soob3123_amoral-gemma3-12b ( #5034 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-18 09:38:05 +01:00
Ettore Di Giacinto
04ebbbd73a
chore(model gallery): add mlabonne_gemma-3-4b-it-abliterated ( #5033 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-18 09:36:14 +01:00
Ettore Di Giacinto
55305e0d95
chore(model gallery): add mlabonne_gemma-3-12b-it-abliterated ( #5032 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-18 09:32:41 +01:00
Ettore Di Giacinto
67623639e4
chore(model gallery): add mlabonne_gemma-3-27b-it-abliterated ( #5031 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-18 09:30:25 +01:00
LocalAI [bot]
cc76def342
chore: ⬆️ Update ggml-org/llama.cpp to b1b132efcba216c873715c483809730bb253f4a1 ( #5029 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-17 21:43:15 +00:00
Ettore Di Giacinto
4967fa5928
chore(model gallery): disable gemma3 mmproj
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-17 12:34:21 +01:00
Ettore Di Giacinto
2b98e4ec56
chore(model gallery): update gemma3 URLs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-17 12:22:35 +01:00
Ettore Di Giacinto
fa1d058ee2
chore(model gallery): add mproj files for gemma3 models ( #5028 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-17 12:11:46 +01:00
Ettore Di Giacinto
a49a588bfa
chore(model gallery): add readyart_forgotten-safeword-70b-3.6 ( #5027 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-17 11:50:34 +01:00
LocalAI [bot]
ca7dda61c6
chore: ⬆️ Update ggml-org/llama.cpp to 8ba95dca2065c0073698afdfcda4c8a8f08bf0d9 ( #5026 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-16 21:42:17 +00:00
Ettore Di Giacinto
ffedddd76d
chore(model gallery): add beaverai_mn-2407-dsk-qwqify-v0.1-12b ( #5024 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-16 09:33:19 +01:00
Ettore Di Giacinto
766c76ae8e
chore(model gallery): add pocketdoc_dans-sakurakaze-v1.0.0-12b ( #5023 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-16 09:29:48 +01:00
LocalAI [bot]
3096ff33e9
chore: ⬆️ Update ggml-org/llama.cpp to f4c3dd5daa3a79f713813cf1aabdc5886071061d ( #5022 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-15 21:43:48 +00:00
Ettore Di Giacinto
90a7451da4
chore(model gallery): add allura-org_bigger-body-70b ( #5021 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-15 14:43:51 +01:00
LocalAI [bot]
529a4b9ee8
chore: ⬆️ Update ggml-org/llama.cpp to 9f2250ba722738ec0e6ab684636268a79160c854 ( #5019 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-14 21:45:54 +00:00
Ettore Di Giacinto
0567e104eb
chore(model gallery): add eurollm-9b-instruct ( #5017 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-14 09:25:44 +01:00
Ettore Di Giacinto
ecbeacd022
chore(model gallery): add prithivmlmods_viper-coder-32b-elite13 ( #5016 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-14 09:20:27 +01:00
Ettore Di Giacinto
2772960e41
chore(model gallery): add nousresearch_deephermes-3-llama-3-3b-preview ( #5015 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-14 09:16:17 +01:00
Ettore Di Giacinto
1b694191e2
chore(model gallery): add nousresearch_deephermes-3-mistral-24b-preview ( #5014 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-14 09:13:27 +01:00
Ettore Di Giacinto
69578a5f8f
chore(model gallery): add models/qgallouedec_gemma-3-27b-it-codeforces-sft ( #5013 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-14 09:11:13 +01:00
LocalAI [bot]
7d96cfe72b
chore: ⬆️ Update ggml-org/llama.cpp to 84d547554123a62e9ac77107cb20e4f6cc503af4 ( #5011 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-13 22:30:17 +00:00
Ettore Di Giacinto
423514a5a5
fix(clip): do not imply GPU offload by default ( #5010 )
...
* fix(clip): do not imply GPUs by default
Until a better solution is found upstream, be conservative and default
to GPU.
https://github.com/ggml-org/llama.cpp/pull/12322
https://github.com/ggml-org/llama.cpp/pull/12322#issuecomment-2720970695
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* allow to override gpu via backend options
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-13 15:14:11 +01:00
Ettore Di Giacinto
12568c7d6d
chore(model gallery): add gemma-3-1b-it ( #5009 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-13 09:48:40 +01:00
Ettore Di Giacinto
8d16a0a536
chore(model gallery): add gemma-3-4b-it ( #5008 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-13 09:47:01 +01:00
Ettore Di Giacinto
87ca801f00
chore(model gallery): add gemma-3-12b-it ( #5007 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-13 09:44:49 +01:00
Ettore Di Giacinto
e4ecbb6c30
chore(model gallery): add gemma-3-27b-it ( #5003 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-13 08:28:28 +01:00
LocalAI [bot]
b1a67de2b9
chore: ⬆️ Update ggml-org/llama.cpp to f08f4b3187b691bb08a8884ed39ebaa94e956707 ( #5006 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-13 01:01:30 +00:00
LocalAI [bot]
71a23910fe
chore: ⬆️ Update ggml-org/llama.cpp to 80a02aa8588ef167d616f76f1781b104c245ace0 ( #5004 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-12 16:26:09 +00:00
LocalAI [bot]
0ede31f9cf
chore: ⬆️ Update ggml-org/llama.cpp to 10f2e81809bbb69ecfe64fc8b4686285f84b0c07 ( #4996 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-12 14:13:04 +00:00
Ettore Di Giacinto
9f5dcf2d1e
feat(aio): update AIO image defaults ( #5002 )
...
* feat(aio): update AIO image defaults
cpu:
- text-to-text: llama3.1
- embeddings: granite-embeddings
- vision: moonream2
gpu/intel:
- text-to-text: localai-functioncall-qwen2.5-7b-v0.5
- embeddings: granite-embeddings
- vision: minicpm
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(aio): use minicpm as moondream2 stopped working
https://github.com/ggml-org/llama.cpp/pull/12322#issuecomment-2717483759
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-12 12:55:06 +01:00
Ettore Di Giacinto
e878556e98
chore(model gallery): add trashpanda-org_qwq-32b-snowdrop-v0 ( #5000 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-12 08:26:09 +01:00
Ettore Di Giacinto
b096928172
chore(model gallery): add open-r1_olympiccoder-7b ( #4999 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-12 08:24:35 +01:00
Ettore Di Giacinto
db7442ae67
chore(model gallery): add open-r1_olympiccoder-32b ( #4998 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-12 08:23:01 +01:00
Ettore Di Giacinto
b6cd430e08
chore(model gallery): add thedrummer_gemmasutra-small-4b-v1 ( #4997 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-12 08:19:51 +01:00
LocalAI [bot]
478e50cda2
chore: ⬆️ Update ggml-org/llama.cpp to 2c9f833d17bb5b8ea89dec663b072b5420fc5438 ( #4991 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-03-11 11:19:03 +00:00
Ettore Di Giacinto
1db2b9943c
chore(deps): Bump grpcio to 1.71.0 ( #4993 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-11 09:44:21 +01:00
Ettore Di Giacinto
ac41aa8b67
chore(model gallery): add openpipe_deductive-reasoning-qwen-32b ( #4995 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-11 09:44:07 +01:00
Ettore Di Giacinto
156a98e2e7
chore(model gallery): add openpipe_deductive-reasoning-qwen-14b ( #4994 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-11 09:40:38 +01:00
dependabot[bot]
d88ec1209e
chore(deps): Bump docs/themes/hugo-theme-relearn from 4a4b60e to 9a020e7 ( #4988 )
...
chore(deps): Bump docs/themes/hugo-theme-relearn
Bumps [docs/themes/hugo-theme-relearn](https://github.com/McShelby/hugo-theme-relearn ) from `4a4b60e` to `9a020e7`.
- [Release notes](https://github.com/McShelby/hugo-theme-relearn/releases )
- [Commits](4a4b60ef04...9a020e7ead )
---
updated-dependencies:
- dependency-name: docs/themes/hugo-theme-relearn
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-03-11 09:39:04 +01:00
dependabot[bot]
fde8dbfc80
chore(deps): Bump appleboy/ssh-action from 1.2.1 to 1.2.2 ( #4978 )
...
Bumps [appleboy/ssh-action](https://github.com/appleboy/ssh-action ) from 1.2.1 to 1.2.2.
- [Release notes](https://github.com/appleboy/ssh-action/releases )
- [Changelog](https://github.com/appleboy/ssh-action/blob/master/.goreleaser.yaml )
- [Commits](https://github.com/appleboy/ssh-action/compare/v1.2.1...v1.2.2 )
---
updated-dependencies:
- dependency-name: appleboy/ssh-action
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-03-11 08:29:31 +01:00
Ettore Di Giacinto
879dc73eba
Revert "chore(deps): Bump intel-extension-for-pytorch from 2.3.110+xpu to 2.6.10+xpu in /backend/python/diffusers" ( #4992 )
...
Revert "chore(deps): Bump intel-extension-for-pytorch from 2.3.110+xpu to 2.6…"
This reverts commit 1dfc52de16 .
2025-03-11 08:29:05 +01:00
dependabot[bot]
1dfc52de16
chore(deps): Bump intel-extension-for-pytorch from 2.3.110+xpu to 2.6.10+xpu in /backend/python/diffusers ( #4973 )
...
chore(deps): Bump intel-extension-for-pytorch
Bumps intel-extension-for-pytorch from 2.3.110+xpu to 2.6.10+xpu.
---
updated-dependencies:
- dependency-name: intel-extension-for-pytorch
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-03-10 21:14:43 +00:00
Ettore Di Giacinto
1331129485
fix(routes): do not gate generated artifacts via key ( #4971 )
...
fix(routes): do not gate generated images via key
We generate unique uris for images.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-10 15:58:25 +01:00
Ettore Di Giacinto
1cd98062e5
chore(model gallery): add hyperllama3.1-v2-i1 ( #4970 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-10 10:52:36 +01:00
Ettore Di Giacinto
9791d9b77a
chore(model gallery): add opencrystal-l3-15b-v2.1-i1 ( #4969 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-10 10:50:02 +01:00
Ettore Di Giacinto
8956452a45
chore(model gallery): add llmevollama-3.1-8b-v0.1-i1 ( #4968 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-10 10:46:49 +01:00
LocalAI [bot]
f3659fa49c
chore: ⬆️ Update ggml-org/llama.cpp to 1e2f78a00450593e2dfa458796fcdd9987300dfc ( #4966 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-09 21:44:52 +00:00
Ettore Di Giacinto
585f2be793
chore(model gallery): add tower-babel_babel-9b-chat ( #4964 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-09 12:09:56 +01:00
LocalAI [bot]
d13f160222
chore: ⬆️ Update ggml-org/llama.cpp to 0fd7ca7a210bd4abc995cd728491043491dbdef7 ( #4963 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-08 21:41:26 +00:00
Ettore Di Giacinto
db5495b9d7
chore(model gallery): add goppa-ai_goppa-logillama ( #4962 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-08 11:37:40 +01:00
Ettore Di Giacinto
3def1ae232
chore(model gallery): add huihui-ai_qwq-32b-abliterated ( #4961 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-08 11:34:37 +01:00
Ettore Di Giacinto
c6ebead8e5
chore(model gallery): add steelskull_l3.3-electra-r1-70b ( #4960 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-08 11:23:42 +01:00
LocalAI [bot]
cff4a950e0
chore: ⬆️ Update ggml-org/llama.cpp to 7ab364390f92b0b8d83f69821a536b424838f3f8 ( #4959 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-07 22:54:28 +00:00
Ettore Di Giacinto
e4fa894153
fix(llama.cpp): correctly handle embeddings in batches ( #4957 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-07 19:29:52 +01:00
Ettore Di Giacinto
69caccfa82
chore(model gallery): add granite embeddings models ( #4956 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-06 23:17:40 +01:00
Ettore Di Giacinto
ab50c13160
chore(model gallery): add nomic-embed-text-v1.5 ( #4955 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-06 23:10:39 +01:00
LocalAI [bot]
56d4e82b14
chore: ⬆️ Update ggml-org/llama.cpp to 3d652bfddfba09022525067e672c3c145c074649 ( #4954 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-06 21:54:14 +00:00
Ettore Di Giacinto
09b5bd48bc
chore(model gallery): add rombo-org_rombo-llm-v3.1-qwq-32b ( #4953 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-06 10:42:45 +01:00
Ettore Di Giacinto
957dcfb6a9
chore(model gallery): add qwen_qwq-32b ( #4952 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-06 10:28:03 +01:00
Ettore Di Giacinto
67f7bffd18
chore(deps): update llama.cpp and sync with upstream changes ( #4950 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-06 00:40:58 +01:00
Ettore Di Giacinto
de81b42b49
feat(ui): remove api key handling and small ui adjustments ( #4948 )
...
* chore(ui): drop set api key button
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(ui): shore in-progress installs in model view
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(ui): improve text to image view
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-05 19:37:36 +01:00
Ettore Di Giacinto
06eb7e9fa7
chore(model gallery): add llama-3.3-magicalgirl-2.5-i1 ( #4946 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-05 09:35:48 +01:00
Ettore Di Giacinto
45bc1ac566
chore(model gallery): add lolzinventor_meta-llama-3.1-8b-survivev3 ( #4945 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-05 09:33:41 +01:00
Ettore Di Giacinto
02aafeff75
chore(model gallery): add llama-3.1-8b-instruct-uncensored-delmat-i1 ( #4944 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-05 09:31:09 +01:00
Ettore Di Giacinto
6b46c52789
feat(ui): complete design overhaul ( #4942 )
...
This PR changes entirely the UI look and feeling. It updates all
sections and makes it also mobile-ready.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-05 08:27:03 +01:00
LocalAI [bot]
d732e261a4
chore: ⬆️ Update ggml-org/llama.cpp to 5bbe6a9fe9a8796a9389c85accec89dbc4d91e39 ( #4943 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-04 21:46:40 +00:00
Ettore Di Giacinto
807c574e91
chore(model gallery): add azura-qwen2.5-32b-i1 ( #4941 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-04 10:33:15 +01:00
Ettore Di Giacinto
bb171a39b3
chore(model gallery): add llama-3.3-magicalgirl-2 ( #4940 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-04 10:07:38 +01:00
Ettore Di Giacinto
941a4fc50e
chore(model gallery): add boomer_qwen_72b-i1 ( #4939 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-04 10:01:23 +01:00
Ettore Di Giacinto
afe65bd7bf
chore(model gallery): add l3.3-geneticlemonade-unleashed-70b-i1 ( #4938 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-04 09:56:31 +01:00
Ettore Di Giacinto
6f9762049c
chore(model gallery): update qihoo360_tinyr1-32b-preview ( #4937 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-04 09:53:23 +01:00
LocalAI [bot]
122970d70d
chore: ⬆️ Update ggml-org/llama.cpp to dfd6b2c0be191b3abe2fd9c1b25deff01c6249d8 ( #4936 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-03 21:50:31 +00:00
dependabot[bot]
8664b1c7a2
chore(deps): Bump docs/themes/hugo-theme-relearn from 02bba0f to 4a4b60e ( #4934 )
...
chore(deps): Bump docs/themes/hugo-theme-relearn
Bumps [docs/themes/hugo-theme-relearn](https://github.com/McShelby/hugo-theme-relearn ) from `02bba0f` to `4a4b60e`.
- [Release notes](https://github.com/McShelby/hugo-theme-relearn/releases )
- [Commits](02bba0f199...4a4b60ef04 )
---
updated-dependencies:
- dependency-name: docs/themes/hugo-theme-relearn
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-03-03 19:56:41 +00:00
Ettore Di Giacinto
c92166f38a
chore(model gallery): add steelskull_l3.3-mokume-gane-r1-70b-v1.1 ( #4933 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-03 09:22:22 +01:00
LocalAI [bot]
d616058b12
chore: ⬆️ Update ggml-org/llama.cpp to 14dec0c2f29ae56917907dbf2eed6b19438d0a0e ( #4932 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-02 22:27:01 +00:00
Ettore Di Giacinto
a7b4001b75
feat: allow to specify a reply prefix ( #4931 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-02 16:07:32 +01:00
Ettore Di Giacinto
ff85f01459
chore(model gallery): add thedrummer_fallen-llama-3.3-r1-70b-v1 ( #4930 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-02 10:29:28 +01:00
Ettore Di Giacinto
695f81a08b
chore(model gallery): add qihoo360_tinyr1-32b-preview ( #4929 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-02 10:24:17 +01:00
Ettore Di Giacinto
326be287da
chore(model gallery): add ibm-granite_granite-3.2-2b-instruct ( #4928 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-02 10:22:35 +01:00
Ettore Di Giacinto
0404d98190
chore(model gallery): add ibm-granite_granite-3.2-8b-instruct ( #4927 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-02 10:19:27 +01:00
LocalAI [bot]
0a8ec1eb22
chore: ⬆️ Update ggml-org/llama.cpp to 1782cdfed60952f9ff333fc2ab5245f2be702453 ( #4926 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-03-02 10:02:49 +01:00
Ettore Di Giacinto
d860932dcd
fix(chatml): add endoftext stopword
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-01 21:16:10 +01:00
Ettore Di Giacinto
1cb137bd2d
fix(deephermes): correct typo
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-03-01 17:07:12 +01:00
Ettore Di Giacinto
3c279e5568
chore(model gallery): add allenai_olmocr-7b-0225-preview ( #4924 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-01 09:10:04 +01:00
Ettore Di Giacinto
fb55e3df57
chore(model gallery): add ozone-research_0x-lite ( #4923 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-01 09:07:01 +01:00
Ettore Di Giacinto
de46fb6e2e
chore(model gallery): add ozone-research_chirp-01 ( #4922 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-01 09:05:03 +01:00
Ettore Di Giacinto
d7a0e3c5ea
chore(model gallery): add microsoft_phi-4-mini-instruct ( #4921 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-03-01 08:58:01 +01:00
LocalAI [bot]
0533ea817d
chore: ⬆️ Update ggml-org/llama.cpp to 06c2b1561d8b882bc018554591f8c35eb04ad30e ( #4920 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-28 22:18:14 +00:00
Ettore Di Giacinto
755e4fb5f4
feat(ui): improvements to index and models page ( #4918 )
...
- mobile-friendly index
- adjust color palette
- improve search experience
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-28 19:23:32 +01:00
LocalAI [bot]
e4fdde158f
chore: ⬆️ Update ggml-org/llama.cpp to b95c8af37ccf169b0a3216b7ed691af0534e5091 ( #4916 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-28 00:00:39 +00:00
Ettore Di Giacinto
6d0712fa6d
fix(ui): not all models comes from gallery ( #4915 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-27 19:12:41 +01:00
Ettore Di Giacinto
bbbb28e3ca
fix(models): unify usecases identifications ( #4914 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-27 15:51:12 +01:00
Ettore Di Giacinto
3bf2e9d065
fix(ui): not all models have an Icon ( #4913 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-27 10:52:19 +01:00
Ettore Di Giacinto
1461fd8777
chore(model gallery): add locutusque_thespis-llama-3.1-8b ( #4912 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-27 10:02:44 +01:00
LocalAI [bot]
054860539a
chore: ⬆️ Update ggml-org/llama.cpp to a800ae46da2ed7dac236aa6bf2b595da6b6294b5 ( #4911 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-26 22:43:49 +00:00
Ettore Di Giacinto
c87870b18e
feat(ui): improve chat interface ( #4910 )
...
* feat(ui): show more informations in the chat view, minor adjustments to model gallery
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(ui): UI improvements
Visual improvements and bugfixes including:
- disable pagination during search
- fix scrolling on new message
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-26 18:27:18 +01:00
Ettore Di Giacinto
5ad2be9c45
feat(ui): small improvements to chat interface ( #4907 )
...
- Change chat colors
- Improve layout on small windows
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-26 11:10:40 +01:00
LocalAI [bot]
61a24746a1
chore: ⬆️ Update ggml-org/llama.cpp to d7cfe1ffe0f435d0048a6058d529daf76e072d9c ( #4908 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-25 21:58:37 +00:00
Ettore Di Giacinto
d557eb9361
chore(model gallery): add latitudegames_wayfarer-large-70b-llama-3.3 ( #4903 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-25 10:21:54 +01:00
Ettore Di Giacinto
a9a1a361a9
chore(model gallery): add perplexity-ai_r1-1776-distill-llama-70b ( #4902 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-25 09:59:21 +01:00
Ettore Di Giacinto
12d070af80
chore(model gallery): add sicariussicariistuff_phi-line_14b ( #4901 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-25 09:56:44 +01:00
LocalAI [bot]
8d40557bc8
chore: ⬆️ Update ggml-org/llama.cpp to 7a2c913e66353362d7f28d612fd3c9d51a831eda ( #4899 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-25 09:51:02 +01:00
dependabot[bot]
5a5f3a899a
chore(deps): Bump docs/themes/hugo-theme-relearn from 66bc366 to 02bba0f ( #4898 )
...
chore(deps): Bump docs/themes/hugo-theme-relearn
Bumps [docs/themes/hugo-theme-relearn](https://github.com/McShelby/hugo-theme-relearn ) from `66bc366` to `02bba0f`.
- [Release notes](https://github.com/McShelby/hugo-theme-relearn/releases )
- [Commits](66bc366c47...02bba0f199 )
---
updated-dependencies:
- dependency-name: docs/themes/hugo-theme-relearn
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-02-25 09:50:46 +01:00
dependabot[bot]
a2d1f133c8
chore(deps): Bump appleboy/ssh-action from 1.2.0 to 1.2.1 ( #4896 )
...
Bumps [appleboy/ssh-action](https://github.com/appleboy/ssh-action ) from 1.2.0 to 1.2.1.
- [Release notes](https://github.com/appleboy/ssh-action/releases )
- [Changelog](https://github.com/appleboy/ssh-action/blob/master/.goreleaser.yaml )
- [Commits](https://github.com/appleboy/ssh-action/compare/v1.2.0...v1.2.1 )
---
updated-dependencies:
- dependency-name: appleboy/ssh-action
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-02-24 21:01:39 +00:00
LocalAI [bot]
0ae6420c31
chore: ⬆️ Update ggml-org/llama.cpp to 7ad0779f5de84a68143b2c00ab5dc94a948925d3 ( #4890 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-24 11:26:09 +01:00
Ettore Di Giacinto
3a3e05cf18
chore(model gallery): add flux.1dev-abliteratedv2 ( #4895 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-24 10:11:32 +01:00
Ettore Di Giacinto
6a20388e25
chore(model gallery): add nohobby_l3.3-prikol-70b-extra ( #4894 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-24 09:43:50 +01:00
Ettore Di Giacinto
06c836a937
chore(model gallery): add steelskull_l3.3-san-mai-r1-70b ( #4893 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-24 09:41:06 +01:00
Ettore Di Giacinto
049a13fe78
chore(model gallery): add steelskull_l3.3-cu-mai-r1-70b ( #4892 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-24 09:39:12 +01:00
Ettore Di Giacinto
30bf6c962f
chore(stable-diffusion-ggml): update, adapt upstream changes ( #4889 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-23 08:36:41 +01:00
LocalAI [bot]
a72b3a23c3
chore: ⬆️ Update ggml-org/llama.cpp to a28e0d5eb18c18e6a4598286158f427269b1444e ( #4887 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-23 08:36:25 +01:00
Ettore Di Giacinto
e9971b168a
feat(ui): paginate model gallery ( #4886 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-22 21:38:00 +01:00
Ettore Di Giacinto
5b59b5e0c1
chore(model gallery): add steelskull_l3.3-mokume-gane-r1-70b ( #4885 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-22 18:58:06 +01:00
Ettore Di Giacinto
8cfd712428
chore(model gallery): add arcee-ai_arcee-maestro-7b-preview ( #4884 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-22 11:32:25 +01:00
Ettore Di Giacinto
21f7faa80d
chore(model gallery): add ozone-ai_reverb-7b ( #4883 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-22 11:28:27 +01:00
Ettore Di Giacinto
a6a0121118
chore(model gallery): add rombo-org_rombo-llm-v3.0-qwen-72b ( #4882 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-22 11:19:04 +01:00
LocalAI [bot]
ba66aa33c5
chore: ⬆️ Update ggml-org/llama.cpp to 51f311e057723b7454d0ebe20f545a1a2c4db6b2 ( #4881 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-21 21:51:02 +00:00
Ettore Di Giacinto
8fc024a770
chore(model gallery): add pocketdoc_dans-personalityengine-v1.2.0-24b ( #4880 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-21 10:00:23 +01:00
Ettore Di Giacinto
52aa9d08aa
chore(model gallery): add l3.1-8b-rp-ink ( #4879 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-21 09:56:57 +01:00
Ettore Di Giacinto
4c9379c39e
chore(model gallery): add smirki_uigen-t1.1-qwen-7b ( #4878 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-21 09:54:42 +01:00
Ettore Di Giacinto
0ff2c39364
chore(model gallery): add smirki_uigen-t1.1-qwen-14b ( #4877 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-21 09:52:20 +01:00
LocalAI [bot]
1af7e5dc49
chore: ⬆️ Update ggml-org/llama.cpp to c392e5094deaf2d1a7c18683214f007fad3fe42b ( #4876 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-20 22:03:52 +00:00
Ettore Di Giacinto
af3bb64e42
fix(coqui): pin transformers ( #4875 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-20 16:16:54 +01:00
Ettore Di Giacinto
77281f836e
chore(model gallery): add internlm_oreal-7b ( #4874 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-20 15:57:21 +01:00
Ettore Di Giacinto
550275811d
chore(model gallery): add internlm_oreal-deepseek-r1-distill-qwen-7b ( #4873 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-20 15:55:13 +01:00
Ettore Di Giacinto
c27ce6c54d
chore(model gallery): add internlm_oreal-32b ( #4872 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-20 15:52:28 +01:00
Ettore Di Giacinto
ac4991b069
chore(docs): update sponsor logo
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-20 15:31:41 +01:00
Ettore Di Giacinto
25bee71bb8
feat(ui): do also filter tts and image models ( #4871 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-20 15:02:18 +01:00
LocalAI [bot]
b993780a3b
chore: ⬆️ Update ggml-org/llama.cpp to d04e7163c85a847bc61d58c22f2c503596db7aa8 ( #4870 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-20 09:42:57 +01:00
Ettore Di Giacinto
ea0c9f1168
feat(ui): show only text models in the chat interface ( #4869 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-19 17:34:30 +01:00
Ettore Di Giacinto
08311f275a
chore(model gallery): add sentientagi_dobby-unhinged-llama-3.3-70b ( #4868 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-19 10:36:36 +01:00
Ettore Di Giacinto
4de0f2f737
chore(model gallery): add open-r1_openr1-qwen-7b ( #4867 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-19 10:04:01 +01:00
Ettore Di Giacinto
42ae807c41
chore(model gallery): add pygmalionai_pygmalion-3-12b ( #4866 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-19 10:02:35 +01:00
LocalAI [bot]
94593ba4c3
chore: ⬆️ Update ggml-org/llama.cpp to 63e489c025d61c7ca5ec06c5d10f36e2b76aaa1d ( #4865 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-19 09:19:28 +01:00
Brandon Beiler
6a6e1a0ea9
feat(vllm): Additional vLLM config options (Disable logging, dtype, and Per-Prompt media limits) ( #4855 )
...
* Adding the following vLLM config options: disable_log_status, dtype, limit_mm_per_prompt
Signed-off-by: TheDropZone <brandonbeiler@gmail.com >
* using " marks in the config.yaml file
Signed-off-by: TheDropZone <brandonbeiler@gmail.com >
* adding in missing colon
Signed-off-by: TheDropZone <brandonbeiler@gmail.com >
---------
Signed-off-by: TheDropZone <brandonbeiler@gmail.com >
2025-02-18 19:27:58 +01:00
Ettore Di Giacinto
5b19af99ff
feat(ui): detect model usage and display link ( #4864 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-18 19:27:07 +01:00
Ettore Di Giacinto
28fb8e607a
chore(model gallery): add nbeerbower_dumpling-qwen2.5-72b ( #4862 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-18 12:44:59 +01:00
Ettore Di Giacinto
bb85b6ef00
feat: improve ui models list in the index ( #4863 )
...
* feat(ui): improve index
- Redirect to the chat view when clicking on a model
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Display chat icon nearby the model
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-18 12:44:44 +01:00
Ettore Di Giacinto
b9b5a635ca
chore(model gallery): add nbeerbower_dumpling-qwen2.5-32b-v2 ( #4861 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-18 11:53:23 +01:00
Ettore Di Giacinto
131ea5b627
chore(model gallery): add nbeerbower_dumpling-qwen2.5-14b ( #4860 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-18 11:51:29 +01:00
Ettore Di Giacinto
fac70e9642
chore(model gallery): add allenai_llama-3.1-tulu-3.1-8b ( #4859 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-18 11:49:26 +01:00
Ettore Di Giacinto
7e76ea40fb
chore(model gallery): add kubeguru-llama3.2-3b-v0.1 ( #4858 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-18 11:47:00 +01:00
LocalAI [bot]
de09ae42ef
chore: ⬆️ Update ggml-org/llama.cpp to 73e2ed3ce3492d3ed70193dd09ae8aa44779651d ( #4854 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-18 09:11:07 +01:00
Ettore Di Giacinto
6424f0666d
chore(deps): Bump edgevpn to v0.30.1 ( #4840 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-17 16:51:22 +01:00
Ettore Di Giacinto
f3ae94ca70
chore: update Image generation docs and examples ( #4841 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-17 16:51:06 +01:00
LocalAI [bot]
09c9f67a02
chore: ⬆️ Update ggml-org/llama.cpp to 2eea03d86a2d132c8245468c26290ce07a27a8e8 ( #4839 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-17 10:55:30 +01:00
Ettore Di Giacinto
c264ca542d
fix(ci): update repository for llama.cpp
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-02-17 09:33:34 +01:00
Bas Hulsken
bbf30d416d
fix: change initialization order of llama-cpp-avx512 to go before avx2 variant ( #4837 )
...
changed to initialization order of the avx512 version of llama.cpp, now tries before avx2
Signed-off-by: Bas Hulsken <bhulsken@hotmail.com >
2025-02-17 09:32:21 +01:00
Ettore Di Giacinto
27617a1b06
chore(model gallery): add ozone-ai_0x-lite ( #4835 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-16 09:23:26 +01:00
Ettore Di Giacinto
e84081769e
chore(ci): cleanup before pulling images again
2025-02-16 09:20:22 +01:00
LocalAI [bot]
20119fc580
docs: ⬆️ update docs version mudler/LocalAI ( #4834 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-15 22:45:11 +00:00
Ettore Di Giacinto
09941c0bfb
chore(docs): update license year
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-15 18:17:15 +01:00
Ettore Di Giacinto
cabe0f4993
chore(model gallery): add davidbrowne17_llamathink-8b-instruct ( #4833 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-15 17:31:46 +01:00
Ettore Di Giacinto
1977c7f190
chore(model gallery): add pygmalionai_eleusis-12b ( #4832 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-15 17:21:30 +01:00
Ettore Di Giacinto
061e7c4eae
chore(model gallery): add rombo-org_rombo-llm-v3.0-qwen-32b ( #4830 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-15 10:58:27 +01:00
LocalAI [bot]
5313e660f6
chore: ⬆️ Update ggerganov/llama.cpp to 300907b2110cc17b4337334dc397e05de2d8f5e0 ( #4829 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-14 21:51:49 +00:00
Ettore Di Giacinto
9e32fda304
fix(llama.cpp): improve context shift handling ( #4820 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-14 14:55:03 +01:00
Ettore Di Giacinto
83202cae54
chore(model gallery): add nousresearch_deephermes-3-llama-3-8b-preview ( #4828 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-14 12:25:00 +01:00
Ettore Di Giacinto
d96addfa9d
chore(model gallery): add open-thoughts_openthinker-32b ( #4827 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-14 12:03:46 +01:00
Ettore Di Giacinto
a715fe588d
chore(model gallery): add sicariussicariistuff_phi-lthy4 ( #4826 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-14 11:57:14 +01:00
LocalAI [bot]
2ac4a86bb4
chore: ⬆️ Update ggerganov/llama.cpp to 8a8c4ceb6050bd9392609114ca56ae6d26f5b8f5 ( #4825 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-13 21:49:57 +00:00
Ettore Di Giacinto
8670d480a6
chore(model gallery): add nvidia_aceinstruct-72b ( #4822 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-13 09:58:34 +01:00
Ettore Di Giacinto
af0b4ff237
chore(ci): update labels
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-02-13 09:58:19 +01:00
Ettore Di Giacinto
e694764065
chore(model gallery): add nvidia_aceinstruct-7b ( #4821 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-13 09:44:53 +01:00
Ettore Di Giacinto
f3c27e0381
chore(model gallery): add nvidia_aceinstruct-1.5b ( #4819 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-13 09:33:40 +01:00
LocalAI [bot]
bf44319d0d
chore: ⬆️ Update ggerganov/llama.cpp to 0fb77f821f6e70ad8b8247a97d1022f0fef78991 ( #4814 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-12 22:41:53 +00:00
Ettore Di Giacinto
5b133a640b
chore(model gallery): add theskullery_l3.3-exp-unnamed-model-70b-v0.5 ( #4813 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-12 11:05:51 +01:00
Ettore Di Giacinto
0030a3fe75
chore(model gallery): add simplescaling_s1.1-32b ( #4812 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-12 11:03:05 +01:00
Ettore Di Giacinto
0a748b009e
chore(ci): avoit cache hits until the ci gRPC job is fixed
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-12 09:11:40 +01:00
LocalAI [bot]
257e951def
chore: ⬆️ Update ggerganov/llama.cpp to 90e4dba461b07e635fd1daf3b491c978c7dd0013 ( #4810 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-12 00:13:28 +01:00
LocalAI [bot]
fbd82a2dd0
feat(swagger): update swagger ( #4809 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-11 21:54:40 +00:00
Ettore Di Giacinto
5db321dad2
chore(ci): do not always regenerate the cache
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-11 16:36:00 +01:00
Ettore Di Giacinto
f5638a6354
feat(diffusers): allow to override image gen options ( #4807 )
...
Use the options field in the model to override kwargs if needed.
This allows to specify from the model yaml config:
```yaml
options:
- foo:bar
```
And each option will be used directly when calling the diffusers
pipeline, e.g:
```python
pipe(
foo="bar",
)
```
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-11 10:16:32 +01:00
Ettore Di Giacinto
5f64cc6328
Revert "chore(deps): Bump docs/themes/lotusdocs from f5785a2 to 975da91" ( #4808 )
...
Revert "chore(deps): Bump docs/themes/lotusdocs from `f5785a2` to `975da91` (…"
This reverts commit e57b750ca3 .
2025-02-11 10:05:57 +01:00
Ettore Di Giacinto
28b10e8804
chore(swagger): update ( #4805 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-11 09:51:01 +01:00
Ettore Di Giacinto
3277f5095d
chore(model gallery): add agentica-org_deepscaler-1.5b-preview ( #4804 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-11 09:47:19 +01:00
Ettore Di Giacinto
fe3ced2919
chore(ci): try again to bump parallelism in grpc jobs
...
As we moved these out to self-hosted
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-11 09:31:00 +01:00
LocalAI [bot]
45e37a07bb
chore: ⬆️ Update ggerganov/llama.cpp to 19b392d58dc08c366d0b29bd3b9c6991fa4e1662 ( #4803 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-11 09:17:02 +01:00
dependabot[bot]
e57b750ca3
chore(deps): Bump docs/themes/lotusdocs from f5785a2 to 975da91 ( #4801 )
...
Bumps [docs/themes/lotusdocs](https://github.com/colinwilson/lotusdocs ) from `f5785a2` to `975da91`.
- [Release notes](https://github.com/colinwilson/lotusdocs/releases )
- [Commits](f5785a2399...975da91e83 )
---
updated-dependencies:
- dependency-name: docs/themes/lotusdocs
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-02-10 22:27:14 +00:00
Ettore Di Giacinto
49df492268
chore(ci): run grpc build on self-hosted
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-10 19:44:50 +01:00
Ettore Di Giacinto
516cd660f1
chore(grpcio): reduce parallelism ( #4799 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-10 18:56:13 +01:00
Ettore Di Giacinto
8fd3ace9a1
chore(grpcio): bump to 1.70 ( #4798 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-10 18:38:53 +01:00
Ettore Di Giacinto
099469cb05
chore(tests): decrease parallelism for gRPC builds ( #4797 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-10 12:59:59 +01:00
Ettore Di Giacinto
6be8c0c618
chore(model gallery): add localai-functioncall-qwen2.5-7b-v0.5 ( #4796 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-10 12:07:35 +01:00
Dave
3cddf24747
feat: Centralized Request Processing middleware ( #3847 )
...
* squash past, centralize request middleware PR
Signed-off-by: Dave Lee <dave@gray101.com >
* migrate bruno request files to examples repo
Signed-off-by: Dave Lee <dave@gray101.com >
* fix
Signed-off-by: Dave Lee <dave@gray101.com >
* Update tests/e2e-aio/e2e_test.go
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
---------
Signed-off-by: Dave Lee <dave@gray101.com >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-02-10 12:06:16 +01:00
Ettore Di Giacinto
c330360785
chore(model gallery): add ilsp_llama-krikri-8b-instruct ( #4795 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-10 09:54:54 +01:00
LocalAI [bot]
8cd51570e5
chore: ⬆️ Update ggerganov/llama.cpp to 19d3c8293b1f61acbe2dab1d49a17950fd788a4a ( #4793 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-09 22:12:01 +00:00
Ettore Di Giacinto
0e7aa5cd15
chore(model gallery): add subtleone_qwen2.5-32b-erudite-writer ( #4792 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-09 10:59:46 +01:00
Ettore Di Giacinto
e06a5f49de
chore(model gallery): add huihui-ai_deepseek-r1-distill-llama-70b-abliterated ( #4790 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-09 10:53:48 +01:00
Dave
fb2f847507
chore: migrate bruno request files to examples repo ( #4788 )
...
migrate bruno request files to examples repo
Signed-off-by: Dave Lee <dave@gray101.com >
2025-02-09 10:52:28 +01:00
LocalAI [bot]
e01acc88c9
chore: ⬆️ Update ggerganov/llama.cpp to e6e658319952f7ad269dc11275b9edddc721fc6d ( #4787 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-08 21:57:40 +00:00
LocalAI [bot]
7a5912908a
chore: ⬆️ Update ggerganov/llama.cpp to d2fe216fb2fb7ca8627618c9ea3a2e7886325780 ( #4780 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-08 09:44:34 +01:00
Ettore Di Giacinto
4b1b942a7f
chore(model gallery): add sicariussicariistuff_redemption_wind_24b ( #4781 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-08 09:04:18 +01:00
Ettore Di Giacinto
230fe0098f
chore(model gallery): add cognitivecomputations_dolphin3.0-mistral-24b ( #4779 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-07 13:33:24 +01:00
Ettore Di Giacinto
cc163429dc
chore(model gallery): add cognitivecomputations_dolphin3.0-r1-mistral-24b ( #4778 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-07 13:31:49 +01:00
Ettore Di Giacinto
f670e0a91c
chore(model gallery): add nohobby_l3.3-prikol-70b-v0.5 ( #4777 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-07 13:29:53 +01:00
LocalAI [bot]
731674eee7
chore: ⬆️ Update ggerganov/llama.cpp to 8a59053f63fffc24e730cd3ea067760abfe4a919 ( #4776 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-06 22:02:00 +00:00
Ettore Di Giacinto
cc1f6f913f
fix(llama.cpp): disable mirostat as default ( #2911 )
...
Even if increasing the quality of the output, it has shown to have
performance drawbacks to be so noticeable that the confuses users about
speed of LocalAI ( see also
https://github.com/mudler/LocalAI/issues/2780 ).
This changeset disables Mirostat by default (which can
be still enabled manually).
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: Dave <dave@gray101.com >
2025-02-06 19:39:59 +01:00
Ettore Di Giacinto
7f90ff7aec
chore(llama-ggml): drop deprecated backend ( #4775 )
...
The GGML format is now dead, since in the next version of LocalAI we
already bring many breaking compatibility changes, taking the occasion
also to drop ggml support (pre-gguf).
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-06 18:36:23 +01:00
Ettore Di Giacinto
8d45670e41
fix(openai): consistently return stop reason ( #4771 )
...
We were not returning a stop reason when no tool was actually called
(even if specified).
Fixes: https://github.com/mudler/LocalAI/issues/4716
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-06 12:41:08 +01:00
Ettore Di Giacinto
e4b8ddb6a1
chore(model gallery): add black-ink-guild_pernicious_prophecy_70b ( #4774 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-06 12:03:59 +01:00
Ettore Di Giacinto
a801561f81
chore(model gallery): add tiger-lab_qwen2.5-32b-instruct-cft ( #4773 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-06 12:01:56 +01:00
Ettore Di Giacinto
16ced07102
chore(model gallery): add arliai_llama-3.3-70b-arliai-rpmax-v1.4 ( #4772 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-06 11:59:14 +01:00
LocalAI [bot]
d35595372d
chore: ⬆️ Update ggerganov/llama.cpp to d774ab3acc4fee41fbed6dbfc192b57d5f79f34b ( #4770 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-06 09:02:51 +01:00
LocalAI [bot]
81be192279
chore: ⬆️ Update leejet/stable-diffusion.cpp to d46ed5e184b97c2018dc2e8105925bdb8775e02c ( #4769 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-05 23:49:15 +00:00
Ettore Di Giacinto
28a1310890
chore(docs): enhance visibility
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-05 19:50:32 +01:00
Ettore Di Giacinto
2a702e9ca4
chore(docs): small updates
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-05 19:49:11 +01:00
Ettore Di Giacinto
3ecaea1b6e
chore(docs): update sponsors in the website
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-05 19:41:55 +01:00
Ettore Di Giacinto
7daf5ac3e3
fix(gallery): do not return overrides and additional config ( #4768 )
...
When hitting /models/available we are intersted in the model
description, name and small metadatas. Configuration and overrides are
part of internals which are required only for installation.
This also solves a current bug when hitting /models/available fails if
one of the gallery items have overrides with parameters defined
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-05 18:37:09 +01:00
Ettore Di Giacinto
7bc80c17f8
chore(model gallery): add LocalAI-functioncall-llama3.2-3b-v0.5 ( #4766 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-05 10:19:31 +01:00
Ettore Di Giacinto
1996ceb293
chore(model gallery): add krutrim-ai-labs_krutrim-2-instruct ( #4765 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-05 10:17:05 +01:00
Ettore Di Giacinto
0bc3dc43da
chore(model gallery): add rubenroy_gilgamesh-72b ( #4764 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-05 10:13:21 +01:00
Ettore Di Giacinto
3324c4e6cb
chore(model gallery): add agi-0_art-skynet-3b ( #4763 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-05 10:09:33 +01:00
LocalAI [bot]
7329db4e78
chore: ⬆️ Update ggerganov/llama.cpp to 3ec9fd4b77b6aca03a3c2bf678eae3f9517d6904 ( #4762 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-04 21:48:49 +00:00
Ettore Di Giacinto
464686aee6
chore(model gallery): add suayptalha_maestro-10b ( #4760 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-04 09:51:54 +01:00
Ettore Di Giacinto
bfa3d4ccff
chore(model gallery): add nohobby_l3.3-prikol-70b-v0.4 ( #4759 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-04 09:50:18 +01:00
Ettore Di Giacinto
6a91288c8c
chore(model gallery): add fblgit_miniclaus-qw1.5b-unamgs-grpo ( #4758 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-04 09:45:52 +01:00
dependabot[bot]
96cb407ee0
chore(deps): Bump docs/themes/hugo-theme-relearn from 5bcb9fe to 66bc366 ( #4750 )
...
chore(deps): Bump docs/themes/hugo-theme-relearn
Bumps [docs/themes/hugo-theme-relearn](https://github.com/McShelby/hugo-theme-relearn ) from `5bcb9fe` to `66bc366`.
- [Release notes](https://github.com/McShelby/hugo-theme-relearn/releases )
- [Commits](5bcb9fe5e6...66bc366c47 )
---
updated-dependencies:
- dependency-name: docs/themes/hugo-theme-relearn
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-02-04 08:57:19 +01:00
dependabot[bot]
5a19094d3a
chore(deps): Bump sentence-transformers from 3.4.0 to 3.4.1 in /backend/python/transformers ( #4748 )
...
chore(deps): Bump sentence-transformers in /backend/python/transformers
Bumps [sentence-transformers](https://github.com/UKPLab/sentence-transformers ) from 3.4.0 to 3.4.1.
- [Release notes](https://github.com/UKPLab/sentence-transformers/releases )
- [Commits](https://github.com/UKPLab/sentence-transformers/compare/v3.4.0...v3.4.1 )
---
updated-dependencies:
- dependency-name: sentence-transformers
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-02-04 08:56:51 +01:00
LocalAI [bot]
e3b943ffcb
chore: ⬆️ Update ggerganov/llama.cpp to 5598f475be3e31430fbe17ebb85654ec90dc201e ( #4757 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-04 08:56:11 +01:00
dependabot[bot]
df30d6a482
chore(deps): Bump GrantBirki/git-diff-action from 2.7.0 to 2.8.0 ( #4746 )
...
Bumps [GrantBirki/git-diff-action](https://github.com/grantbirki/git-diff-action ) from 2.7.0 to 2.8.0.
- [Release notes](https://github.com/grantbirki/git-diff-action/releases )
- [Commits](https://github.com/grantbirki/git-diff-action/compare/v2.7.0...v2.8.0 )
---
updated-dependencies:
- dependency-name: GrantBirki/git-diff-action
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-02-03 22:21:40 +00:00
Ettore Di Giacinto
c3c27b7e3d
chore(model gallery): small fixups to llama3.2-fcall template
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-03 17:58:57 +01:00
Ettore Di Giacinto
431716d4d6
fix(gallery): remove box token to llama3.2-fcall
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-02-03 16:10:33 +01:00
Ettore Di Giacinto
d290fd159f
chore(model gallery): add LocalAI-functioncall-llama3.2-1b-v0.4 ( #4740 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-03 15:55:49 +01:00
Ettore Di Giacinto
051faaf771
chore(model gallery): add uncensoredai_uncensoredlm-deepseek-r1-distill-qwen-14b ( #4739 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-03 10:46:47 +01:00
Ettore Di Giacinto
41a2dfb0d9
chore(model gallery): add thedrummer_gemmasutra-pro-27b-v1.1 ( #4738 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-03 10:37:24 +01:00
Ettore Di Giacinto
ed0094c3d0
chore(model gallery): add steelskull_l3.3-damascus-r1 ( #4737 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-03 10:30:07 +01:00
LocalAI [bot]
52fadeded1
feat(swagger): update swagger ( #4735 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-03 10:16:42 +01:00
LocalAI [bot]
a37fa8d9c4
chore: ⬆️ Update ggerganov/llama.cpp to 90f9b88afb6447d3929843a2aa98c0f11074762d ( #4736 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-02 22:18:30 +00:00
Shraddha
03974a4dd4
feat: tokenization with llama.cpp ( #4724 )
...
feat: tokenization
Signed-off-by: shraddhazpy <shraddha@shraddhafive.in >
2025-02-02 17:39:43 +00:00
Ettore Di Giacinto
1d6afbd65d
feat(llama.cpp): Add support to grammar triggers ( #4733 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-02 13:25:03 +01:00
LocalAI [bot]
d79f02ea09
chore: ⬆️ Update ggerganov/llama.cpp to 53debe6f3c9cca87e9520a83ee8c14d88977afa4 ( #4732 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-02-01 21:45:26 +00:00
Ettore Di Giacinto
ba2f426e3e
chore(model gallery): add fuseo1-deekseekr1-qwq-skyt1-32b-preview ( #4731 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-02-01 10:12:15 +01:00
LocalAI [bot]
732042e5c6
chore: ⬆️ Update ggerganov/llama.cpp to aa6fb1321333fae8853d0cdc26bcb5d438e650a1 ( #4728 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-01-31 22:31:00 +00:00
Ettore Di Giacinto
f1763aabf2
chore(model gallery): add taid-llm-1.5b ( #4727 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-31 14:53:39 +01:00
Ettore Di Giacinto
e0d90b173b
chore(model gallery): add tinyswallow-1.5b-instruct ( #4726 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-31 14:49:02 +01:00
Ettore Di Giacinto
ff07612bfa
chore(model gallery): add mistral-small-24b-instruct-2501 ( #4725 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-31 14:45:42 +01:00
LocalAI [bot]
7badaf78a0
chore: ⬆️ Update ggerganov/llama.cpp to 8b576b6c55bc4e6be898b47522f0ef402b93ef62 ( #4722 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-01-31 11:31:46 +00:00
Ettore Di Giacinto
af41436f1b
fix(tests): pin to branch for config used in tests ( #4721 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-31 09:57:58 +01:00
LocalAI [bot]
cd5489ce47
chore(model-gallery): ⬆️ update checksum ( #4723 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-01-31 08:51:32 +01:00
Ettore Di Giacinto
60ec2cf751
chore(model gallery): add openthinker-7b ( #4720 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-30 16:44:44 +01:00
Ettore Di Giacinto
244f4b564f
chore(model gallery): add selene-1-mini-llama-3.1-8b ( #4719 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-30 16:42:48 +01:00
Ettore Di Giacinto
f1d6d65417
chore(model gallery): add virtuoso-lite ( #4718 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-30 16:38:35 +01:00
Ettore Di Giacinto
72e52c4f6a
chore: drop embedded models ( #4715 )
...
Since the remote gallery was introduced this is now completely
superseded by it. In order to keep the code clean and remove redudant
parts let's simplify the usage.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-30 00:03:01 +01:00
LocalAI [bot]
1656e1a88e
chore: ⬆️ Update ggerganov/llama.cpp to eb7cf15a808d4d7a71eef89cc6a9b96fe82989dc ( #4717 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-01-29 21:45:38 +00:00
Ettore Di Giacinto
7f62b418a4
chore(docs): add documentation for l4t images
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-29 15:16:07 +01:00
Maximilian Kenfenheuer
1f4e66d638
chore(model gallery): add specific message templates for llama3.2 based models ( #4707 )
...
* chore(model gallery): add specific message templates for llama3.2 based models
Signed-off-by: Maximilian Kenfenheuer <maximilian.kenfenheuer@ksol.it >
* fix: yaml lint in llama3.2-quantized.yaml
Signed-off-by: Maximilian Kenfenheuer <maximilian.kenfenheuer@ksol.it >
* fix: yaml lint in llama3.2-quantized.yaml
Signed-off-by: Maximilian Kenfenheuer <maximilian.kenfenheuer@ksol.it >
---------
Signed-off-by: Maximilian Kenfenheuer <maximilian.kenfenheuer@ksol.it >
2025-01-29 10:19:48 +01:00
Maximilian Kenfenheuer
a37b2c765c
docs: update advanced-usage.md to reflect changes in #4700 ( #4709 )
...
Signed-off-by: Maximilian Kenfenheuer <maximilian.kenfenheuer@ksol.it >
2025-01-28 22:58:35 +01:00
Maximilian Kenfenheuer
b4b67e00bd
refactor: function argument parsing using named regex ( #4708 )
...
Signed-off-by: Maximilian Kenfenheuer <maximilian.kenfenheuer@ksol.it >
2025-01-28 22:58:02 +01:00
LocalAI [bot]
91e1ff5a95
chore: ⬆️ Update ggerganov/llama.cpp to cae9fb4361138b937464524eed907328731b81f6 ( #4711 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-01-28 21:45:14 +00:00
dependabot[bot]
d9204ea3b5
chore(deps): Bump dependabot/fetch-metadata from 2.2.0 to 2.3.0 ( #4701 )
...
Bumps [dependabot/fetch-metadata](https://github.com/dependabot/fetch-metadata ) from 2.2.0 to 2.3.0.
- [Release notes](https://github.com/dependabot/fetch-metadata/releases )
- [Commits](https://github.com/dependabot/fetch-metadata/compare/v2.2.0...v2.3.0 )
---
updated-dependencies:
- dependency-name: dependabot/fetch-metadata
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-01-28 11:50:09 +01:00
LocalAI [bot]
3d0fbcb4f7
chore: ⬆️ Update ggerganov/llama.cpp to a4417ddda98fd0558fb4d802253e68a933704b59 ( #4705 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-01-28 09:13:43 +01:00
dependabot[bot]
03f3df9a82
chore(deps): Bump docs/themes/hugo-theme-relearn from 8dad5ee to 5bcb9fe ( #4704 )
...
chore(deps): Bump docs/themes/hugo-theme-relearn
Bumps [docs/themes/hugo-theme-relearn](https://github.com/McShelby/hugo-theme-relearn ) from `8dad5ee` to `5bcb9fe`.
- [Release notes](https://github.com/McShelby/hugo-theme-relearn/releases )
- [Commits](8dad5ee419...5bcb9fe5e6 )
---
updated-dependencies:
- dependency-name: docs/themes/hugo-theme-relearn
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-01-28 09:13:00 +01:00
dependabot[bot]
fff35d5528
chore(deps): Bump sentence-transformers from 3.3.1 to 3.4.0 in /backend/python/transformers ( #4702 )
...
chore(deps): Bump sentence-transformers in /backend/python/transformers
Bumps [sentence-transformers](https://github.com/UKPLab/sentence-transformers ) from 3.3.1 to 3.4.0.
- [Release notes](https://github.com/UKPLab/sentence-transformers/releases )
- [Commits](https://github.com/UKPLab/sentence-transformers/compare/v3.3.1...v3.4.0 )
---
updated-dependencies:
- dependency-name: sentence-transformers
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-01-27 21:09:50 +00:00
Maximilian Kenfenheuer
539e94db73
feat: function argument parsing using named regex ( #4700 )
...
Signed-off-by: Maximilian Kenfenheuer <maximilian.kenfenheuer@ksol.it >
2025-01-27 15:53:05 +00:00
Ettore Di Giacinto
0f4f62cf3c
chore(model gallery): add fuseo1-deepseekr1-qwq-32b-preview ( #4699 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-27 09:51:06 +01:00
Ettore Di Giacinto
e7cffd7afa
chore(model gallery): add fuseo1-deepseekr1-qwen2.5-instruct-32b-preview ( #4698 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-27 09:31:47 +01:00
Ettore Di Giacinto
26d790a2b6
chore(model gallery): add fuseo1-deepseekr1-qwen2.5-coder-32b-preview-v0.1 ( #4697 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-27 09:28:29 +01:00
Ettore Di Giacinto
5cf838c08d
chore(model gallery): add confucius-o1-14b ( #4696 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-27 09:26:00 +01:00
LocalAI [bot]
4db8f5cbce
chore: ⬆️ Update ggerganov/llama.cpp to 178a7eb952d211b8d4232d5e50ae1b64519172a9 ( #4694 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-01-26 21:44:54 +00:00
Ettore Di Giacinto
3b6b37a81b
chore(model gallery): add deepseek-r1-qwen-2.5-32b-ablated ( #4693 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-26 10:06:06 +01:00
Ettore Di Giacinto
8f5aa2d9de
chore(model gallery): add dumpling-qwen2.5-32b ( #4692 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-26 10:03:46 +01:00
Ettore Di Giacinto
a6bc8aa7c7
chore(model gallery): add l3.3-nevoria-r1-70b ( #4691 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-26 10:01:37 +01:00
LocalAI [bot]
4ab107bc1a
chore: ⬆️ Update ggerganov/llama.cpp to 26771a1491f3a4c3d5b99c4c267b81aca9a7dfa0 ( #4690 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-01-25 21:44:14 +00:00
Ettore Di Giacinto
4c3710a531
chore(model gallery): add chuluun-qwen2.5-72b-v0.08 ( #4689 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-25 11:07:31 +01:00
Ettore Di Giacinto
901b06284a
chore(model gallery): add art-v0-3b ( #4688 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-25 11:06:05 +01:00
Ettore Di Giacinto
8eef5a2c5e
chore(model gallery): add lamarck-14b-v0.7 ( #4687 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-25 11:04:12 +01:00
Gianluca Boiano
e9cace137b
chore(model gallery): update deepseek-r1 prompt template ( #4686 )
...
Signed-off-by: Gianluca Boiano <morf3089@gmail.com >
2025-01-25 09:04:38 +01:00
LocalAI [bot]
9409c99738
chore: ⬆️ Update ggerganov/llama.cpp to c5d9effb49649db80a52caf5c0626de6f342f526 ( #4685 )
...
⬆️ Update ggerganov/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-01-24 21:45:54 +00:00
Ettore Di Giacinto
4d44ebc2f2
chore(deps): bump grpcio to 1.70.0 ( #4682 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-01-24 10:18:22 +01:00