mirror of
https://github.com/mudler/LocalAI.git
synced 2026-06-02 13:22:34 -04:00
* feat(parakeet-cpp): L0 backend scaffold, LoadModel + AudioTranscription (text) Add a Go gRPC backend that bridges LocalAI to parakeet.cpp via the flat C-API (parakeet_capi.h), loaded with purego (cgo-less, mirrors the whisper / vibevoice-cpp backends). L0 scope: - main.go: dlopen libparakeet.so (override via PARAKEET_LIBRARY), register the C-API entry points, start the gRPC server. - goparakeetcpp.go: Load (parakeet_capi_load), AudioTranscription (parakeet_capi_transcribe_path, decoder=0 = per-arch default head), Free, serialized through base.SingleThread since the C engine is a thread-unsafe singleton. char* returns are bound as uintptr so the malloc'd buffer is freed via parakeet_capi_free_string after copy. - AudioTranscriptionStream returns a clear "not implemented in L0" error (closes the channel so the server doesn't hang), wired in L2. - Makefile: clone-at-pin + cmake (PARAKEET_VERSION for bump_deps.sh), with a local-symlink dev shortcut; run.sh / package.sh mirror whisper. - Test auto-skips without PARAKEET_BACKEND_TEST_MODEL/_WAV fixtures. Builds clean (CGO_ENABLED=0), gofmt clean, test passes. The single unsafeptr vet note in goStringFromCPtr is documented and matches the whisper backend's tolerated pattern. Word/segment timestamps (L1) and cache-aware streaming (L2) follow. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(parakeet-cpp): L1 word/segment timestamps via transcribe_path_json AudioTranscription now calls parakeet_capi_transcribe_path_json and shapes the per-word / per-token timestamps into the TranscriptResult: - Bind parakeet_capi_transcribe_path_json (purego, char* as uintptr like the other returns) and register it in main.go + the test loader. - Parse the JSON document ({"text","words":[{w,start,end,conf}], "tokens":[{id,t,conf}]}) into typed structs. - Synthesise a single whole-clip segment (parakeet emits no native segment boundaries) spanning the first word start to the last word end; token ids populate Segment.Tokens. - Attach word-level timings only when timestamp_granularities=["word"], matching the OpenAI API (segment-level default). secondsToNanos mirrors the whisper backend's nanosecond convention. Verified end-to-end against tdt_ctc-110m (f16): both the default and word-granularity specs pass; builds clean, gofmt clean, vet shows only the one documented unsafeptr note shared with the whisper backend. Cache-aware streaming (L2) follows. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(parakeet-cpp): L2 cache-aware streaming with EOU segmentation Wire AudioTranscriptionStream to the streaming RNN-T C-API: - Bind parakeet_capi_stream_{begin,feed,finalize,free}; feed takes 16 kHz mono float PCM ([]float32 via purego) and writes *eou_out on <EOU>/<EOB>. - Decode opts.Dst to 16 kHz mono PCM (utils.AudioToWav + go-audio, same as the whisper backend), feed it in 1 s chunks, and emit each newly-finalized text run as a TranscriptStreamResponse delta. - <EOU>/<EOB> events close the current segment; a closing FinalResult carries the full transcript plus the per-utterance segments (with a whole-clip fallback segment when no EOU fired). - stream_begin returns 0 for non-streaming models, surfaced as a clear error instead of an empty stream. Honours context cancellation between chunks. Frees every malloc'd delta and the session. Verified end-to-end against realtime_eou_120m-v1 (f16): the streamed transcript matches the offline 110m reference word-for-word, deltas reconstruct the final text, and the spec passes alongside the offline specs. Builds clean, gofmt clean, vet shows only the shared documented unsafeptr note. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(parakeet-cpp): L3 register backend in build/CI/gallery (whisper parity) Wire the new Go gRPC parakeet-cpp backend (parakeet.cpp ggml port of NVIDIA NeMo Parakeet ASR) into LocalAI's build/CI/gallery surfaces, matching the existing ggml whisper Go backend 1:1. - .github/backend-matrix.yml: add 11 linux entries + 1 darwin entry mirroring every whisper build (cpu amd64/arm64, intel sycl f32/f16, vulkan amd64/arm64, nvidia cuda-12, nvidia cuda-13, nvidia-l4t-arm64, nvidia-l4t-cuda-13-arm64, rocm hipblas, metal-darwin-arm64), all on ./backend/Dockerfile.golang with backend: "parakeet-cpp" and -*-parakeet-cpp tag-suffixes. - scripts/changed-backends.js: explicit inferBackendPath branch resolving parakeet-cpp to backend/go/parakeet-cpp/ before the generic golang branch. - .github/workflows/bump_deps.yaml: track the PARAKEET_VERSION pin in backend/go/parakeet-cpp/Makefile (repo mudler/parakeet.cpp, branch master). - backend/index.yaml: add ¶keetcpp meta + latest/development image entries for every matrix tag-suffix. - Makefile: add backends/parakeet-cpp to .NOTPARALLEL, BACKEND_PARAKEET_CPP definition, docker-build target eval, and test-extra-backend-parakeet-cpp- transcription target (mirrors test-extra-backend-whisper-transcription). Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(parakeet-cpp): L4 gallery importer for parakeet GGUFs Add ParakeetCppImporter so parakeet.cpp GGUFs auto-detect on /import-model and route to the parakeet-cpp backend (it also surfaces in /backends/known, which drives the import dropdown). - Match is narrow: a .gguf whose name carries a parakeet architecture token (<arch>-<size>-<quant>.gguf, e.g. tdt_ctc-110m-f16.gguf, rnnt-0.6b-q4_k.gguf, realtime_eou_120m-v1-q8_0.gguf), a direct URL to one, or preferences.backend="parakeet-cpp". It deliberately does NOT claim arbitrary llama-style GGUFs, nor the upstream nvidia/parakeet-* NeMo repos (.nemo, not runnable here). - Registered in the ASR batch BEFORE LlamaCPPImporter so its GGUFs aren't swallowed by the generic .gguf importer. - Import nests files under parakeet-cpp/models/<name>/, defaults to the smallest quant (q4_k, near-lossless on parakeet) with a size-ladder fallback, and honours preferences.quantizations / name / description. Tested with synthetic HF details (no network): metadata, positive matches (HF repo, direct URL, preference), narrowness negatives (llama GGUF, NeMo repo), and import (default quant, override, direct URL), 9 specs pass, build/vet/gofmt clean. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * docs(parakeet-cpp): document the parakeet-cpp transcription backend Add parakeet-cpp to the audio-to-text backend list and a dedicated usage section: direct GGUF import (auto-detects to the backend), model YAML, word-level timestamps via timestamp_granularities[]=word, and cache-aware streaming with the realtime_eou model. Points at the mudler/parakeet-cpp-gguf collection repo. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci(parakeet-cpp): wire transcription gRPC e2e test into test-extra The L3 commit added the test-extra-backend-parakeet-cpp-transcription Makefile target but never invoked it in CI. Mirror the whisper job: - Add a parakeet-cpp output to detect-changes (emitted by changed-backends.js from the matrix entry). - Add tests-parakeet-cpp-grpc-transcription, gated on the parakeet-cpp path filter / run-all, building the backend image and running the transcription e2e against tdt_ctc-110m + the JFK clip. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * style(parakeet-cpp): drop em dashes from comments and docs Replace em dashes with plain punctuation in the backend comments, the importer, package.sh, and the audio-to-text docs section (and use "and" instead of the multiplication sign). No behaviour change. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(gallery): add parakeet-cpp f16 models to the model gallery Add the 10 NVIDIA Parakeet models (f16, the recommended quality/speed default) as gallery entries that install on the parakeet-cpp backend from mudler/parakeet-cpp-gguf: tdt_ctc-110m/1.1b, tdt-0.6b-v2/v3, tdt-1.1b, ctc-0.6b/1.1b, rnnt-0.6b/1.1b, and the cache-aware streaming realtime_eou_120m-v1. Each pins the file sha256 and routes transcript usecases to the backend. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(parakeet-cpp): satisfy govet lint + bump PARAKEET_VERSION - goparakeetcpp.go: //nolint:govet on the C-owned-pointer unsafe.Pointer conversion (golangci-lint reports new-only issues, so unlike the whisper backend's identical line this one is flagged). - Makefile: bump PARAKEET_VERSION to the current parakeet.cpp master commit (the previous pin's commit no longer exists after upstream history was squashed), so the backend image clone/build resolves again. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(parakeet-cpp): pin PARAKEET_VERSION to a tag-stable commit The previous SHA pin was orphaned when parakeet.cpp's single-commit master was amended/force-pushed, so the backend image clone (git fetch <sha>) failed across every build variant. Repoint to 845c29e, which upstream now keeps permanently fetchable via the `localai-backend-pin` tag, so future upstream amends no longer break the backend build. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(parakeet-cpp): init the ggml submodule in the backend image clone The backend Dockerfile clones parakeet.cpp at PARAKEET_VERSION with a shallow fetch + checkout but never initialised submodules, so third_party/ggml was empty and the parakeet.cpp cmake build failed at `add_subdirectory(third_party/ggml)` (CMakeLists.txt:53) on every build variant. Add `git submodule update --init --recursive --depth 1 --single-branch` after checkout, mirroring the whisper backend. Verified locally: clone + submodule + cmake configure now succeeds. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(parakeet-cpp): statically link ggml into libparakeet.so The shared libparakeet.so linked ggml's shared libs (libggml*.so), but the package only ships libparakeet.so, so at runtime dlopen failed with "libggml.so.0: cannot open shared object file" (the e2e transcription test panicked on load). Build ggml static + PIC (BUILD_SHARED_LIBS=OFF, CMAKE_POSITION_INDEPENDENT_CODE=ON) so libparakeet.so embeds ggml and depends only on system libs already present in the runtime image. Verified locally: ldd shows no libggml dependency. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(parakeet-cpp): non-streaming fallback in AudioTranscriptionStream The e2e streaming test ran AudioTranscriptionStream against tdt_ctc-110m (not a cache-aware streaming model), so stream_begin returned 0 and the call errored. Per LocalAI's streaming contract (and the whisper backend), a non-streaming model should fall back to a single offline transcription emitted as one delta plus a closing FinalResult. Do that instead of erroring, so the streaming endpoint works for every parakeet model. Verified locally: the streaming spec passes against the non-streaming 110m model via fallback. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>
4214 lines
123 KiB
YAML
4214 lines
123 KiB
YAML
---
|
|
# Matrix data for backend container image builds.
|
|
# Consumed by scripts/changed-backends.js for both backend.yml and backend_pr.yml.
|
|
# This file is NOT a workflow — it has no top-level 'on:' or 'jobs:'.
|
|
|
|
# Linux matrix (consumed by backend-jobs).
|
|
include:
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-diffusers'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "diffusers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-vllm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "vllm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-sglang'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "sglang"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-diffusers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "diffusers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-chatterbox'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "chatterbox"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-moonshine'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "moonshine"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# tinygrad ships a single image — its CPU device uses bundled
|
|
# libLLVM, and its CUDA / HIP / Metal devices dlopen the host
|
|
# driver libraries at runtime via tinygrad's ctypes autogen
|
|
# wrappers. There is no toolkit-version split because tinygrad
|
|
# generates kernels itself (PTX renderer for CUDA) and never
|
|
# links against cuDNN/cuBLAS/torch.
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-tinygrad'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "tinygrad"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-whisperx'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "whisperx"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-whisperx'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "whisperx"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-faster-whisper'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "faster-whisper"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-faster-whisper'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "faster-whisper"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-ace-step'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "ace-step"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-trl'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "trl"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-llama-cpp-quantization'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "llama-cpp-quantization"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-llama-cpp-quantization'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "llama-cpp-quantization"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-mlx'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "mlx"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-mlx-vlm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "mlx-vlm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-mlx-audio'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "mlx-audio"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-mlx-distributed'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "mlx-distributed"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# CUDA 12 builds
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-vibevoice'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-liquid-audio'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "liquid-audio"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-qwen-asr'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen-asr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-nemo'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "nemo"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-qwen-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-fish-speech'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "fish-speech"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-faster-qwen3-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "faster-qwen3-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-voxcpm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "voxcpm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-pocket-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "pocket-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-rerankers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rerankers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-llama-cpp'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-cuda-12-amd64'
|
|
# bigger-runner: cold builds for this entry consistently take 5h+ on
|
|
# ubuntu-latest (observed 5h36m on v4.2.1). Move back to bigger-runner
|
|
# so the build finishes well within GHA's 6h job timeout. Phase 5.3 of
|
|
# the free-tier migration (PR #9730) flipped this to ubuntu-latest as
|
|
# a 'highest-risk batch' with explicit per-entry revert.
|
|
runs-on: 'bigger-runner'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "llama-cpp"
|
|
dockerfile: "./backend/Dockerfile.llama-cpp"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-turboquant'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-cuda-12-amd64'
|
|
# bigger-runner: same rationale as -gpu-nvidia-cuda-12-llama-cpp above
|
|
# (observed 6h5m wall-clock on v4.2.1, just past the 6h job timeout).
|
|
runs-on: 'bigger-runner'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "turboquant"
|
|
dockerfile: "./backend/Dockerfile.turboquant"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-vllm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vllm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-vllm-omni'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vllm-omni"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-sglang'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sglang"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-transformers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "transformers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-diffusers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "diffusers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-ace-step'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "ace-step"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-trl'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "trl"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-kokoro'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "kokoro"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-faster-whisper'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "faster-whisper"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-whisperx'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "whisperx"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "9"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-coqui'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "coqui"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-outetts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "outetts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-chatterbox'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "chatterbox"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-moonshine'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "moonshine"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-mlx'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "mlx"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-mlx-vlm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "mlx-vlm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-mlx-audio'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "mlx-audio"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-mlx-distributed'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "mlx-distributed"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-stablediffusion-ggml'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "stablediffusion-ggml"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-sam3-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sam3-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-rfdetr-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rfdetr-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-whisper'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "whisper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-parakeet-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "parakeet-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-acestep-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "acestep-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-qwen3-tts-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen3-tts-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-vibevoice-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-rfdetr'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rfdetr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-insightface'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "insightface"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-speaker-recognition'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "speaker-recognition"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-neutts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "neutts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# cuda 13
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-rerankers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rerankers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-vibevoice'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-liquid-audio'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "liquid-audio"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-qwen-asr'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen-asr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-nemo'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "nemo"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-qwen-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-fish-speech'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "fish-speech"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-faster-qwen3-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "faster-qwen3-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-voxcpm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "voxcpm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-pocket-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "pocket-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-llama-cpp'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-cuda-13-amd64'
|
|
# bigger-runner: cold builds for this entry take 5h+ on ubuntu-latest
|
|
# (observed 5h37m on v4.2.1). Same rationale as the cuda-12 variant.
|
|
runs-on: 'bigger-runner'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "llama-cpp"
|
|
dockerfile: "./backend/Dockerfile.llama-cpp"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-turboquant'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-cuda-13-amd64'
|
|
# bigger-runner: observed 6h5m wall-clock on v4.2.1 — at the GHA timeout.
|
|
runs-on: 'bigger-runner'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "turboquant"
|
|
dockerfile: "./backend/Dockerfile.turboquant"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-llama-cpp'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-cuda-13-arm64'
|
|
base-image: "ubuntu:24.04"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
ubuntu-version: '2404'
|
|
backend: "llama-cpp"
|
|
dockerfile: "./backend/Dockerfile.llama-cpp"
|
|
context: "./"
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-turboquant'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-cuda-13-arm64'
|
|
base-image: "ubuntu:24.04"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
ubuntu-version: '2404'
|
|
backend: "turboquant"
|
|
dockerfile: "./backend/Dockerfile.turboquant"
|
|
context: "./"
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-ds4'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "nvidia/cuda:13.0.0-devel-ubuntu24.04"
|
|
skip-drivers: 'true'
|
|
backend: "ds4"
|
|
dockerfile: "./backend/Dockerfile.ds4"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'true'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-ds4'
|
|
base-image: "nvidia/cuda:13.0.0-devel-ubuntu24.04"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
ubuntu-version: '2404'
|
|
backend: "ds4"
|
|
dockerfile: "./backend/Dockerfile.ds4"
|
|
context: "./"
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-vllm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vllm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-vllm-omni'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vllm-omni"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-sglang'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sglang"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-transformers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "transformers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-diffusers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "diffusers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-ace-step'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "ace-step"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-trl'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "trl"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-vibevoice'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "vibevoice"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-liquid-audio'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "liquid-audio"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-qwen-asr'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "qwen-asr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-qwen-tts'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "qwen-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-fish-speech'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "fish-speech"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-faster-qwen3-tts'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "faster-qwen3-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-pocket-tts'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "pocket-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-chatterbox'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "chatterbox"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-diffusers'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "diffusers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-vllm'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "vllm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-vllm-omni'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "vllm-omni"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-sglang'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "sglang"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-mlx'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "mlx"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-mlx-vlm'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "mlx-vlm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-mlx-audio'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "mlx-audio"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-mlx-distributed'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "mlx-distributed"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-whisperx'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "whisperx"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-faster-whisper'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
ubuntu-version: '2404'
|
|
backend: "faster-whisper"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-kokoro'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "kokoro"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-faster-whisper'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "faster-whisper"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-whisperx'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "whisperx"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-chatterbox'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "chatterbox"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-moonshine'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "moonshine"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-mlx'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "mlx"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-mlx-vlm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "mlx-vlm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-mlx-audio'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "mlx-audio"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-mlx-distributed'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "mlx-distributed"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-stablediffusion-ggml'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "stablediffusion-ggml"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-stablediffusion-ggml'
|
|
base-image: "ubuntu:24.04"
|
|
ubuntu-version: '2404'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "stablediffusion-ggml"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-sam3-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sam3-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-rfdetr-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rfdetr-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-sam3-cpp'
|
|
base-image: "ubuntu:24.04"
|
|
ubuntu-version: '2404'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "sam3-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-rfdetr-cpp'
|
|
base-image: "ubuntu:24.04"
|
|
ubuntu-version: '2404'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "rfdetr-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-whisper'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "whisper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-parakeet-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "parakeet-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-whisper'
|
|
base-image: "ubuntu:24.04"
|
|
ubuntu-version: '2404'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "whisper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-parakeet-cpp'
|
|
base-image: "ubuntu:24.04"
|
|
ubuntu-version: '2404'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "parakeet-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-acestep-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "acestep-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-qwen3-tts-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen3-tts-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-vibevoice-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-acestep-cpp'
|
|
base-image: "ubuntu:24.04"
|
|
ubuntu-version: '2404'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "acestep-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-qwen3-tts-cpp'
|
|
base-image: "ubuntu:24.04"
|
|
ubuntu-version: '2404'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "qwen3-tts-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-cuda-13-arm64-vibevoice-cpp'
|
|
base-image: "ubuntu:24.04"
|
|
ubuntu-version: '2404'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "vibevoice-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-rfdetr'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rfdetr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# hipblas builds
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-rerankers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "rerankers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-llama-cpp'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-rocm-amd64'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "llama-cpp"
|
|
dockerfile: "./backend/Dockerfile.llama-cpp"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-turboquant'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-rocm-amd64'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "turboquant"
|
|
dockerfile: "./backend/Dockerfile.turboquant"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-vllm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "vllm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-vllm-omni'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "vllm-omni"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-sglang'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "sglang"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-transformers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "transformers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-diffusers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "diffusers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-ace-step'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "ace-step"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# ROCm additional backends
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-kokoro'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "kokoro"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-vibevoice'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-liquid-audio'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "liquid-audio"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-qwen-asr'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "qwen-asr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-nemo'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "nemo"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-qwen-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "qwen-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-fish-speech'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "fish-speech"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-voxcpm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "voxcpm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-pocket-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "pocket-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-faster-whisper'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "faster-whisper"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-coqui'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "coqui"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# sycl builds
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-rerankers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.2-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rerankers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f32'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f32-llama-cpp'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-intel-amd64'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.2-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "llama-cpp"
|
|
dockerfile: "./backend/Dockerfile.llama-cpp"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f32'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f32-turboquant'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-intel-amd64'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "turboquant"
|
|
dockerfile: "./backend/Dockerfile.turboquant"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f16'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f16-llama-cpp'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-intel-amd64'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "llama-cpp"
|
|
dockerfile: "./backend/Dockerfile.llama-cpp"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f16'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f16-turboquant'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-intel-amd64'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "turboquant"
|
|
dockerfile: "./backend/Dockerfile.turboquant"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-vllm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vllm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sglang'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sglang"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-transformers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "transformers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-diffusers'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "diffusers"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-ace-step'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "ace-step"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-vibevoice'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "vibevoice"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-qwen-asr'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "qwen-asr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-qwen-tts'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "qwen-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-fish-speech'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "fish-speech"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-faster-qwen3-tts'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "faster-qwen3-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-pocket-tts'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "pocket-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-kokoro'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "kokoro"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-mlx'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "mlx"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-mlx-vlm'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "mlx-vlm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-mlx-audio'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "mlx-audio"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-mlx-distributed'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "mlx-distributed"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-whisperx'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "whisperx"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-faster-whisper'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
skip-drivers: 'true'
|
|
backend: "faster-whisper"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
# SYCL additional backends
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-kokoro'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "kokoro"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-faster-whisper'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "faster-whisper"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-vibevoice'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-liquid-audio'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "liquid-audio"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-qwen-asr'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen-asr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-nemo'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "nemo"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-qwen-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-fish-speech'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "fish-speech"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-voxcpm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "voxcpm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-pocket-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "pocket-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-coqui'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "coqui"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# piper
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-piper'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "piper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-piper'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "piper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-llama-cpp'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-amd64'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "llama-cpp"
|
|
dockerfile: "./backend/Dockerfile.llama-cpp"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-llama-cpp'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-arm64'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "llama-cpp"
|
|
dockerfile: "./backend/Dockerfile.llama-cpp"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-turboquant'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-amd64'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "turboquant"
|
|
dockerfile: "./backend/Dockerfile.turboquant"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-turboquant'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-arm64'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "turboquant"
|
|
dockerfile: "./backend/Dockerfile.turboquant"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-ds4'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "nvidia/cuda:13.0.0-devel-ubuntu24.04"
|
|
skip-drivers: 'true'
|
|
backend: "ds4"
|
|
dockerfile: "./backend/Dockerfile.ds4"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-ds4'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "nvidia/cuda:13.0.0-devel-ubuntu24.04"
|
|
skip-drivers: 'true'
|
|
backend: "ds4"
|
|
dockerfile: "./backend/Dockerfile.ds4"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-ik-llama-cpp'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-amd64'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "ik-llama-cpp"
|
|
dockerfile: "./backend/Dockerfile.ik-llama-cpp"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-arm64-llama-cpp'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-l4t-cuda-12-arm64'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "llama-cpp"
|
|
dockerfile: "./backend/Dockerfile.llama-cpp"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-arm64-turboquant'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-l4t-cuda-12-arm64'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "turboquant"
|
|
dockerfile: "./backend/Dockerfile.turboquant"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-llama-cpp'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-vulkan-amd64'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "llama-cpp"
|
|
dockerfile: "./backend/Dockerfile.llama-cpp"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-llama-cpp'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-vulkan-arm64'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "llama-cpp"
|
|
dockerfile: "./backend/Dockerfile.llama-cpp"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-turboquant'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-vulkan-amd64'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "turboquant"
|
|
dockerfile: "./backend/Dockerfile.turboquant"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# Stablediffusion-ggml
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-turboquant'
|
|
builder-base-image: 'quay.io/go-skynet/ci-cache:base-grpc-vulkan-arm64'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "turboquant"
|
|
dockerfile: "./backend/Dockerfile.turboquant"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# Stablediffusion-ggml
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-stablediffusion-ggml'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "stablediffusion-ggml"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# sam3-cpp
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-sam3-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sam3-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f32'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f32-sam3-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sam3-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f16'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f16-sam3-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sam3-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-sam3-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sam3-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-sam3-cpp'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sam3-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# rfdetr-cpp
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-rfdetr-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rfdetr-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f32'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f32-rfdetr-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rfdetr-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f16'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f16-rfdetr-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rfdetr-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-rfdetr-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rfdetr-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-rfdetr-cpp'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rfdetr-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f32'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f32-stablediffusion-ggml'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "stablediffusion-ggml"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f16'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f16-stablediffusion-ggml'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "stablediffusion-ggml"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-stablediffusion-ggml'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "stablediffusion-ggml"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-stablediffusion-ggml'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "stablediffusion-ggml"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-arm64-stablediffusion-ggml'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "stablediffusion-ggml"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-arm64-sam3-cpp'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "sam3-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-arm64-rfdetr-cpp'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "rfdetr-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
# whisper
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-whisper'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "whisper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-whisper'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "whisper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f32'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f32-whisper'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "whisper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f16'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f16-whisper'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "whisper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-whisper'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "whisper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-whisper'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "whisper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-arm64-whisper'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "whisper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-whisper'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
runs-on: 'ubuntu-latest'
|
|
skip-drivers: 'false'
|
|
backend: "whisper"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# parakeet-cpp
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-parakeet-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "parakeet-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-parakeet-cpp'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "parakeet-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f32'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f32-parakeet-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "parakeet-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f16'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f16-parakeet-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "parakeet-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-parakeet-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "parakeet-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-parakeet-cpp'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "parakeet-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-arm64-parakeet-cpp'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "parakeet-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-parakeet-cpp'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
runs-on: 'ubuntu-latest'
|
|
skip-drivers: 'false'
|
|
backend: "parakeet-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# acestep-cpp
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-acestep-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "acestep-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-acestep-cpp'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "acestep-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f32'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f32-acestep-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "acestep-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f16'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f16-acestep-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "acestep-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-acestep-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "acestep-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-acestep-cpp'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "acestep-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-arm64-acestep-cpp'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "acestep-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-acestep-cpp'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
runs-on: 'ubuntu-latest'
|
|
skip-drivers: 'false'
|
|
backend: "acestep-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# qwen3-tts-cpp
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-qwen3-tts-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen3-tts-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-qwen3-tts-cpp'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen3-tts-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f32'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f32-qwen3-tts-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen3-tts-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f16'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f16-qwen3-tts-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen3-tts-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-qwen3-tts-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen3-tts-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-qwen3-tts-cpp'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen3-tts-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-arm64-qwen3-tts-cpp'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "qwen3-tts-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-qwen3-tts-cpp'
|
|
base-image: "rocm/dev-ubuntu-24.04:6.4.4"
|
|
runs-on: 'ubuntu-latest'
|
|
skip-drivers: 'false'
|
|
backend: "qwen3-tts-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# vibevoice-cpp
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-vibevoice-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-vibevoice-cpp'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-localvqe'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "localvqe"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-localvqe'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "localvqe"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f32'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f32-vibevoice-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'sycl_f16'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-sycl-f16-vibevoice-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-vibevoice-cpp'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-vibevoice-cpp'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-localvqe'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "localvqe"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'vulkan'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-vulkan-localvqe'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "localvqe"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'false'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-arm64-vibevoice-cpp'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "vibevoice-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-vibevoice-cpp'
|
|
base-image: "rocm/dev-ubuntu-24.04:6.4.4"
|
|
runs-on: 'ubuntu-latest'
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice-cpp"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# voxtral
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-voxtral'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "voxtral"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
#opus
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-voxtral'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "voxtral"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
#opus
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-opus'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "opus"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
#silero-vad
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-opus'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "opus"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
#silero-vad
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-silero-vad'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "silero-vad"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# kokoros (Rust TTS)
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-silero-vad'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "silero-vad"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# kokoros (Rust TTS)
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-kokoros'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "kokoros"
|
|
dockerfile: "./backend/Dockerfile.rust"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# local-store
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-local-store'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "local-store"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# rfdetr
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-local-store'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "local-store"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# rfdetr
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-rfdetr'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rfdetr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# insightface (face recognition)
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-rfdetr'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rfdetr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# insightface (face recognition)
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-insightface'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "insightface"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# speaker-recognition (voice/speaker biometrics)
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-insightface'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "insightface"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# speaker-recognition (voice/speaker biometrics)
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-speaker-recognition'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "speaker-recognition"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-speaker-recognition'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "speaker-recognition"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'intel'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-intel-rfdetr'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "intel/oneapi-basekit:2025.3.0-0-devel-ubuntu24.04"
|
|
skip-drivers: 'false'
|
|
backend: "rfdetr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'true'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-arm64-rfdetr'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "rfdetr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
- build-type: 'l4t'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/arm64'
|
|
skip-drivers: 'true'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-nvidia-l4t-arm64-chatterbox'
|
|
base-image: "nvcr.io/nvidia/l4t-jetpack:r36.4.0"
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
backend: "chatterbox"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2204'
|
|
# runs out of space on the runner
|
|
# - build-type: 'hipblas'
|
|
# cuda-major-version: ""
|
|
# cuda-minor-version: ""
|
|
# platforms: 'linux/amd64'
|
|
# tag-latest: 'auto'
|
|
# tag-suffix: '-gpu-hipblas-rfdetr'
|
|
# base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
# runs-on: 'ubuntu-latest'
|
|
# skip-drivers: 'false'
|
|
# backend: "rfdetr"
|
|
# dockerfile: "./backend/Dockerfile.python"
|
|
# context: "./"
|
|
# kitten-tts
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-kitten-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "kitten-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# neutts
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-kitten-tts'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "kitten-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# neutts
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-neutts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "neutts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-neutts'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "neutts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: 'hipblas'
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-rocm-hipblas-neutts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "rocm/dev-ubuntu-24.04:7.2.1"
|
|
skip-drivers: 'false'
|
|
backend: "neutts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-vibevoice'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-liquid-audio'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "liquid-audio"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-vibevoice'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "vibevoice"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-qwen-asr'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen-asr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-qwen-asr'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen-asr"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-nemo'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "nemo"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-nemo'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "nemo"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-qwen-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-qwen-tts'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "qwen-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-fish-speech'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "fish-speech"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-fish-speech'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "fish-speech"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-voxcpm'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "voxcpm"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-pocket-tts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "pocket-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-pocket-tts'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "pocket-tts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-outetts'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'true'
|
|
backend: "outetts"
|
|
dockerfile: "./backend/Dockerfile.python"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# sherpa-onnx CPU
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/amd64'
|
|
platform-tag: 'amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-sherpa-onnx'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sherpa-onnx"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# sherpa-onnx CUDA 12
|
|
- build-type: ''
|
|
cuda-major-version: ""
|
|
cuda-minor-version: ""
|
|
platforms: 'linux/arm64'
|
|
platform-tag: 'arm64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-cpu-sherpa-onnx'
|
|
runs-on: 'ubuntu-24.04-arm'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sherpa-onnx"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# sherpa-onnx CUDA 12
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "12"
|
|
cuda-minor-version: "8"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-12-sherpa-onnx'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sherpa-onnx"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
# sherpa-onnx CUDA 13 — requires onnxruntime 1.24.x+ for the
|
|
# gpu_cuda13 tarball; sherpa-onnx SHERPA_COMMIT pins to v1.12.39.
|
|
- build-type: 'cublas'
|
|
cuda-major-version: "13"
|
|
cuda-minor-version: "0"
|
|
platforms: 'linux/amd64'
|
|
tag-latest: 'auto'
|
|
tag-suffix: '-gpu-nvidia-cuda-13-sherpa-onnx'
|
|
runs-on: 'ubuntu-latest'
|
|
base-image: "ubuntu:24.04"
|
|
skip-drivers: 'false'
|
|
backend: "sherpa-onnx"
|
|
dockerfile: "./backend/Dockerfile.golang"
|
|
context: "./"
|
|
ubuntu-version: '2404'
|
|
|
|
# Darwin matrix (consumed by backend-jobs-darwin).
|
|
includeDarwin:
|
|
- backend: "diffusers"
|
|
tag-suffix: "-metal-darwin-arm64-diffusers"
|
|
build-type: "mps"
|
|
- backend: "ace-step"
|
|
tag-suffix: "-metal-darwin-arm64-ace-step"
|
|
build-type: "mps"
|
|
- backend: "mlx"
|
|
tag-suffix: "-metal-darwin-arm64-mlx"
|
|
build-type: "mps"
|
|
- backend: "chatterbox"
|
|
tag-suffix: "-metal-darwin-arm64-chatterbox"
|
|
build-type: "mps"
|
|
- backend: "mlx-vlm"
|
|
tag-suffix: "-metal-darwin-arm64-mlx-vlm"
|
|
build-type: "mps"
|
|
- backend: "mlx-audio"
|
|
tag-suffix: "-metal-darwin-arm64-mlx-audio"
|
|
build-type: "mps"
|
|
- backend: "mlx-distributed"
|
|
tag-suffix: "-metal-darwin-arm64-mlx-distributed"
|
|
build-type: "mps"
|
|
- backend: "llama-cpp"
|
|
tag-suffix: "-metal-darwin-arm64-llama-cpp"
|
|
lang: "go"
|
|
- backend: "stablediffusion-ggml"
|
|
tag-suffix: "-metal-darwin-arm64-stablediffusion-ggml"
|
|
build-type: "metal"
|
|
lang: "go"
|
|
- backend: "whisper"
|
|
tag-suffix: "-metal-darwin-arm64-whisper"
|
|
build-type: "metal"
|
|
lang: "go"
|
|
- backend: "parakeet-cpp"
|
|
tag-suffix: "-metal-darwin-arm64-parakeet-cpp"
|
|
build-type: "metal"
|
|
lang: "go"
|
|
- backend: "acestep-cpp"
|
|
tag-suffix: "-metal-darwin-arm64-acestep-cpp"
|
|
build-type: "metal"
|
|
lang: "go"
|
|
- backend: "qwen3-tts-cpp"
|
|
tag-suffix: "-metal-darwin-arm64-qwen3-tts-cpp"
|
|
build-type: "metal"
|
|
lang: "go"
|
|
- backend: "vibevoice-cpp"
|
|
tag-suffix: "-metal-darwin-arm64-vibevoice-cpp"
|
|
build-type: "metal"
|
|
lang: "go"
|
|
- backend: "voxtral"
|
|
tag-suffix: "-metal-darwin-arm64-voxtral"
|
|
build-type: "metal"
|
|
lang: "go"
|
|
- backend: "vibevoice"
|
|
tag-suffix: "-metal-darwin-arm64-vibevoice"
|
|
build-type: "mps"
|
|
- backend: "qwen-asr"
|
|
tag-suffix: "-metal-darwin-arm64-qwen-asr"
|
|
build-type: "mps"
|
|
- backend: "nemo"
|
|
tag-suffix: "-metal-darwin-arm64-nemo"
|
|
build-type: "mps"
|
|
- backend: "qwen-tts"
|
|
tag-suffix: "-metal-darwin-arm64-qwen-tts"
|
|
build-type: "mps"
|
|
- backend: "fish-speech"
|
|
tag-suffix: "-metal-darwin-arm64-fish-speech"
|
|
build-type: "mps"
|
|
- backend: "voxcpm"
|
|
tag-suffix: "-metal-darwin-arm64-voxcpm"
|
|
build-type: "mps"
|
|
- backend: "pocket-tts"
|
|
tag-suffix: "-metal-darwin-arm64-pocket-tts"
|
|
build-type: "mps"
|
|
- backend: "moonshine"
|
|
tag-suffix: "-metal-darwin-arm64-moonshine"
|
|
build-type: "mps"
|
|
- backend: "whisperx"
|
|
tag-suffix: "-metal-darwin-arm64-whisperx"
|
|
build-type: "mps"
|
|
- backend: "rerankers"
|
|
tag-suffix: "-metal-darwin-arm64-rerankers"
|
|
build-type: "mps"
|
|
- backend: "transformers"
|
|
tag-suffix: "-metal-darwin-arm64-transformers"
|
|
build-type: "mps"
|
|
- backend: "kokoro"
|
|
tag-suffix: "-metal-darwin-arm64-kokoro"
|
|
build-type: "mps"
|
|
- backend: "faster-whisper"
|
|
tag-suffix: "-metal-darwin-arm64-faster-whisper"
|
|
build-type: "mps"
|
|
- backend: "coqui"
|
|
tag-suffix: "-metal-darwin-arm64-coqui"
|
|
build-type: "mps"
|
|
- backend: "rfdetr"
|
|
tag-suffix: "-metal-darwin-arm64-rfdetr"
|
|
build-type: "mps"
|
|
- backend: "kitten-tts"
|
|
tag-suffix: "-metal-darwin-arm64-kitten-tts"
|
|
build-type: "mps"
|
|
- backend: "piper"
|
|
tag-suffix: "-metal-darwin-arm64-piper"
|
|
build-type: "metal"
|
|
lang: "go"
|
|
- backend: "opus"
|
|
tag-suffix: "-metal-darwin-arm64-opus"
|
|
build-type: "metal"
|
|
lang: "go"
|
|
- backend: "silero-vad"
|
|
tag-suffix: "-metal-darwin-arm64-silero-vad"
|
|
build-type: "metal"
|
|
lang: "go"
|
|
- backend: "local-store"
|
|
tag-suffix: "-metal-darwin-arm64-local-store"
|
|
build-type: "metal"
|
|
lang: "go"
|
|
- backend: "llama-cpp-quantization"
|
|
tag-suffix: "-metal-darwin-arm64-llama-cpp-quantization"
|
|
build-type: "mps"
|