LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-13 03:09:03 -04:00

Files

LocalAI [bot] 50dea8c983 feat(crispasr): bundle espeak-ng and add piper TTS voices to the gallery (#10283 )

CrispASR's piper backend phonemizes non-English text via espeak-ng (dlopen,
the MIT-clean path; English uses a built-in G2P). The FROM scratch crispasr
image shipped none of it, so non-English piper voices loaded but failed
synthesis with "phonemization failed". Bundle the espeak-ng runtime so they
work:

- Dockerfile.golang: install espeak-ng-data + libespeak-ng1 and its libpcaudio0
  / libsonic0 deps in the crispasr builder (espeak's dlopen fails without the
  latter two).
- package.sh: copy libespeak-ng.so.1, libpcaudio.so.0, libsonic.so.0 into
  package/lib/ and the espeak-ng-data dir into the package root.
- run.sh: export CRISPASR_ESPEAK_DATA_PATH so the bundled data is found.

Add 9 single-speaker piper voices (de/en/it, incl. Italian paola + riccardo) to
the gallery, run through backend:piper, hosted at
LocalAI-Community/piper-voices-GGUF (converted from rhasspy/piper-voices with
CrispASR's convert-piper-to-gguf.py). Only single-speaker low/medium voices are
included; the engine does not yet support multi-speaker or high-quality piper
decoders.

All 9 verified end-to-end: each synthesizes a WAV at the model's native sample
rate using only the image-bundled espeak payload.


Assisted-by: Claude:claude-opus-4-8 [Claude Code]

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Co-authored-by: Ettore Di Giacinto <mudler@localai.io>

2026-06-12 23:10:30 +02:00

alpaca.yaml

…

arch-function.yaml

…

bge-m3-colbert.yaml

…

cerbero.yaml

…

chatml-hercules.yaml

…

chatml.yaml

…

codellama.yaml

…

command-r.yaml

…

deephermes.yaml

…

deepseek-r1.yaml

…

deepseek.yaml

…

dreamshaper.yaml

…

falcon3.yaml

…

flux-ggml.yaml

…

flux.yaml

…

gemma.yaml

…

granite3-2.yaml

…

granite4.yaml

…

granite.yaml

…

harmony.yaml

…

hermes-2-pro-mistral.yaml

…

hermes-vllm.yaml

…

index.yaml

feat(crispasr): bundle espeak-ng and add piper TTS voices to the gallery (#10283 )

2026-06-12 23:10:30 +02:00

jamba.yaml

…

kokoros.yaml

…

lfm.yaml

…

liquid-audio.yaml

…

llama3-instruct.yaml

…

llama3.1-instruct-grammar.yaml

…

llama3.1-instruct.yaml

…

llama3.1-reflective.yaml

…

llama3.2-fcall.yaml

…

llama3.2-quantized.yaml

…

llava.yaml

…

ltx-ggml.yaml

…

mathstral.yaml

…

mistral-0.3.yaml

…

moondream.yaml

…

mudler.yaml

…

nanbeige4.1.yaml

…

noromaid.yaml

…

openvino.yaml

…

parler-tts.yaml

…

phi-2-chat.yaml

…

phi-2-orange.yaml

…

phi-3-chat.yaml

…

phi-3-vision.yaml

…

phi-4-chat-fcall.yaml

…

phi-4-chat.yaml

…

piper.yaml

…

pocket-tts.yaml

…

qwen3-deepresearch.yaml

…

qwen3-openbuddy.yaml

…

qwen3.yaml

…

qwen-fcall.yaml

…

qwen-image.yaml

…

rerankers.yaml

…

rwkv.yaml

…

sd-ggml.yaml

…

sentencetransformers.yaml

…

sglang-gemma-4-e2b-mtp.yaml

…

sglang-gemma-4-e4b-mtp.yaml

…

sglang-mimo-7b-mtp.yaml

…

sglang.yaml

…

sherpa-onnx-asr.yaml

…

sherpa-onnx-tts.yaml

…

sherpa-onnx-vad.yaml

…

smolvlm.yaml

…

stablediffusion3.yaml

…

tuluv2.yaml

…

vibevoice.yaml

…

vicuna-chat.yaml

…

virtual.yaml

…

vllm.yaml

…

wan-ggml.yaml

…

whisper-base.yaml

…

wizardlm2.yaml

…

z-image-ggml.yaml

…