LocalAI/core/http/endpoints/openai at 076dcdbed876b68ec2e138434226e6dca2e6d543 - LocalAI - Gitea: Git with a cup of tea

mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-07 00:06:51 -04:00

Files

History

Ettore Di Giacinto 076dcdbed8 refactor(realtime): buffer whole message for TTS, drop sentence segmenter

Per review (richiejp): the sentence segmenter pipelined unary TTS by
splitting on ASCII .!?/newline, which does nothing for languages without
those boundaries (CJK/Thai) — there it already degraded to buffering the
whole message anyway.

Replace it with a uniform model: stream the LLM transcript live, buffer the
full message, then synthesize it once. emitSpeech already streams the audio
chunks when the backend implements TTSStream and falls back to a single
unary delta otherwise, so this is real streaming TTS where supported and a
clean whole-message synthesis elsewhere — no per-sentence emulation, no
language assumptions. speechStreamer becomes transcriptStreamer (transcript
deltas only); the whole-message synthesis moves into streamLLMResponse.

Assisted-by: Claude:claude-opus-4-8 go test, golangci-lint
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2026-06-05 14:03:36 +00:00

..

feat(realtime): WebRTC support (#8790 )

2026-03-13 21:37:15 +01:00

chat_assistant_gate_test.go

chore: Security hardening (#9719 )

2026-05-08 16:25:45 +02:00

chat_assistant_gate.go

chore: Security hardening (#9719 )

2026-05-08 16:25:45 +02:00

chat_emit_test.go

fix(streaming): comply with OpenAI usage / stream_options spec (#9815 )

2026-05-14 08:53:46 +02:00

chat_emit.go

fix(openai): stream usage non-zero when tools are enabled (#9941 )

2026-05-22 10:13:41 +02:00

chat_stream_reasoning_test.go

fix(streaming/tools): don't leak prefill-misclassified content as trailing reasoning chunk (#10000 )

2026-05-26 08:34:26 +02:00

chat_stream_usage_test.go

fix(openai): stream usage non-zero when tools are enabled (#9941 )

2026-05-22 10:13:41 +02:00

chat_stream_workers_test.go

fix(streaming/tools): stop healing-marker stubs from gating off content (#9999 )

2026-05-25 23:55:35 +02:00

chat_stream_workers.go

fix(openai): stop streaming tool-call double-emission when autoparser is active (#10055 )

2026-05-29 11:39:09 +02:00

chat_test.go

fix(reasoning): stop <think> leaking into content when autoparser is in pure-content mode (#9991 )

2026-05-25 22:39:50 +02:00

chat.go

fix(reasoning): stop <think> leaking into content when autoparser is in pure-content mode (#9991 )

2026-05-25 22:39:50 +02:00

completion.go

feat(middleware): Model routing, PII filtering, Cloud model proxies (#9802 )

2026-05-25 09:28:27 +02:00

constants.go

fix(api): SSE streaming format to comply with specification (#7182 )

2025-11-09 22:00:27 +01:00

diarization_test.go

feat(api): add /v1/audio/diarization endpoint with sherpa-onnx + vibevoice.cpp (#9654 )

2026-05-05 15:10:13 +02:00

diarization.go

feat(whisper): honor client cancellation via ggml abort_callback (#9710 )

2026-05-08 01:44:47 +02:00

edit.go

feat(middleware): Model routing, PII filtering, Cloud model proxies (#9802 )

2026-05-25 09:28:27 +02:00

embeddings.go

feat(distributed): gated X-LocalAI-Node response header (middleware + wrapper) (#9976 )

2026-05-25 10:51:48 +02:00

image_test.go

Fix image upload processing and img2img pipeline in diffusers backend (#8879 )

2026-03-11 08:05:50 +01:00

image.go

security(http): refuse redirects on outbound clients via hardened pkg/httpclient (#10087 )

2026-05-30 12:04:10 +02:00

inference_test.go

fix: thinking models with tools returning empty content (reasoning-only retry loop) (#9290 )

2026-04-09 18:30:31 +02:00

inference.go

fix: thinking models with tools returning empty content (reasoning-only retry loop) (#9290 )

2026-04-09 18:30:31 +02:00

inpainting_test.go

feat(distributed): gated X-LocalAI-Node response header (middleware + wrapper) (#9976 )

2026-05-25 10:51:48 +02:00

inpainting.go

feat(distributed): gated X-LocalAI-Node response header (middleware + wrapper) (#9976 )

2026-05-25 10:51:48 +02:00

list.go

feat(api): Allow coding agents to interactively discover how to control and configure LocalAI (#9084 )

2026-04-04 15:14:35 +02:00

openai_suite_test.go

Fix image upload processing and img2img pipeline in diffusers backend (#8879 )

2026-03-11 08:05:50 +01:00

realtime_doubles_test.go

refactor(realtime): buffer whole message for TTS, drop sentence segmenter

2026-06-05 14:03:36 +00:00

realtime_gate_test.go

feat(realtime): Add Liquid Audio s2s model and assistant mode on talk page (#9801 )

2026-05-13 21:57:27 +02:00

realtime_modality_test.go

realtime: honor output_modalities to skip TTS in text-only mode (#9838 )

2026-05-15 12:39:47 +02:00

realtime_model.go

feat(realtime): pipeline disable_thinking maps to enable_thinking off

2026-06-05 14:03:36 +00:00

realtime_reasoning_test.go

feat: forward reasoning_effort to the backend so jinja models honor it (#10184 )

2026-06-05 13:45:43 +00:00

realtime_reasoning.go

feat: forward reasoning_effort to the backend so jinja models honor it (#10184 )

2026-06-05 13:45:43 +00:00

realtime_speech_test.go

fix(realtime): register pipeline streaming/thinking config fields

2026-06-05 14:03:36 +00:00

realtime_speech.go

fix(realtime): clean TTS temp path before read (gosec G304)

2026-06-05 14:03:36 +00:00

realtime_stream_test.go

refactor(realtime): buffer whole message for TTS, drop sentence segmenter

2026-06-05 14:03:36 +00:00

realtime_stream.go

refactor(realtime): buffer whole message for TTS, drop sentence segmenter

2026-06-05 14:03:36 +00:00

realtime_thinking_test.go

fix(realtime): always strip reasoning from spoken output

2026-06-05 14:03:36 +00:00

realtime_thinking.go

fix(realtime): always strip reasoning from spoken output

2026-06-05 14:03:36 +00:00

realtime_transcription_test.go

feat(realtime): streaming transcription text deltas

2026-06-05 14:03:36 +00:00

realtime_transcription.go

feat(realtime): streaming transcription text deltas

2026-06-05 14:03:36 +00:00

realtime_transport_webrtc.go

feat: add distributed mode (#9124 )

2026-03-30 00:47:27 +02:00

realtime_transport_ws.go

feat(realtime): WebRTC support (#8790 )

2026-03-13 21:37:15 +01:00

realtime_transport.go

feat(realtime): WebRTC support (#8790 )

2026-03-13 21:37:15 +01:00

realtime_webrtc.go

feat(realtime): Add Liquid Audio s2s model and assistant mode on talk page (#9801 )

2026-05-13 21:57:27 +02:00

realtime.go

fix(realtime): always strip reasoning from spoken output

2026-06-05 14:03:36 +00:00

transcription.go

feat(whisper): honor client cancellation via ggml abort_callback (#9710 )

2026-05-08 01:44:47 +02:00