LocalAI/core/http/endpoints/openai at 1c4bdfd1d6cc0c18c7b1eee9ccb07fd1301cfed7 - LocalAI - Gitea: Git with a cup of tea

mirror/LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2026-06-03 13:56:46 -04:00

Files

History

Ettore Di Giacinto 1c4bdfd1d6 refactor(openai): route node-header through middleware wrapper

Wire middleware.ExposeNodeHeader onto the OpenAI inference routes
(chat, completions, embeddings) plus the Anthropic /v1/messages shim
and the Ollama chat/generate/embed shims. The wrapper handles
X-LocalAI-Node attribution from a single place, so the per-handler
maybeSetNodeHeader calls and the per-request nodeIDCh rendezvous /
applyNodeIDHeader plumbing in chat.go and completion.go are removed.

For SSE: the wrapper's lazy stamp on the first Write / WriteHeader /
Flush picks up the post-ml.Load node ID from the loader, replacing the
chan signal the worker used to publish. The role=assistant first chunk
emission stays where it is (inside the first token callback) so all
writes still happen AFTER ml.Load has stamped the per-modelID node ID.

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Assisted-by: Claude:claude-opus-4-7[1m]

2026-05-24 21:23:04 +00:00

..

feat(realtime): WebRTC support (#8790 )

2026-03-13 21:37:15 +01:00

chat_assistant_gate_test.go

chore: Security hardening (#9719 )

2026-05-08 16:25:45 +02:00

chat_assistant_gate.go

chore: Security hardening (#9719 )

2026-05-08 16:25:45 +02:00

chat_emit_test.go

fix(streaming): comply with OpenAI usage / stream_options spec (#9815 )

2026-05-14 08:53:46 +02:00

chat_emit.go

fix(openai): stream usage non-zero when tools are enabled (#9941 )

2026-05-22 10:13:41 +02:00

chat_stream_usage_test.go

fix(distributed): per-request node ID rendezvous for streaming header

2026-05-24 20:47:26 +00:00

chat_stream_workers.go

fix(distributed): per-request node ID rendezvous for streaming header

2026-05-24 20:47:26 +00:00

chat_test.go

chore: refactor endpoints to use same inferencing path, add automatic retrial mechanism in case of errors (#9029 )

2026-03-16 21:31:02 +01:00

chat.go

refactor(openai): route node-header through middleware wrapper

2026-05-24 21:23:04 +00:00

completion.go

refactor(openai): route node-header through middleware wrapper

2026-05-24 21:23:04 +00:00

constants.go

fix(api): SSE streaming format to comply with specification (#7182 )

2025-11-09 22:00:27 +01:00

diarization_test.go

feat(api): add /v1/audio/diarization endpoint with sherpa-onnx + vibevoice.cpp (#9654 )

2026-05-05 15:10:13 +02:00

diarization.go

feat(whisper): honor client cancellation via ggml abort_callback (#9710 )

2026-05-08 01:44:47 +02:00

edit.go

fix(streaming): comply with OpenAI usage / stream_options spec (#9815 )

2026-05-14 08:53:46 +02:00

embeddings.go

refactor(openai): route node-header through middleware wrapper

2026-05-24 21:23:04 +00:00

image_test.go

Fix image upload processing and img2img pipeline in diffusers backend (#8879 )

2026-03-11 08:05:50 +01:00

image.go

fix(streaming): comply with OpenAI usage / stream_options spec (#9815 )

2026-05-14 08:53:46 +02:00

inference_test.go

fix: thinking models with tools returning empty content (reasoning-only retry loop) (#9290 )

2026-04-09 18:30:31 +02:00

inference.go

fix: thinking models with tools returning empty content (reasoning-only retry loop) (#9290 )

2026-04-09 18:30:31 +02:00

inpainting_test.go

feat(realtime): WebRTC support (#8790 )

2026-03-13 21:37:15 +01:00

inpainting.go

fix(streaming): comply with OpenAI usage / stream_options spec (#9815 )

2026-05-14 08:53:46 +02:00

list.go

feat(api): Allow coding agents to interactively discover how to control and configure LocalAI (#9084 )

2026-04-04 15:14:35 +02:00

node_header_stream_test.go

test(distributed): cover streaming X-LocalAI-Node header end-to-end

2026-05-24 20:48:06 +00:00

node_header_test.go

refactor(openai): drop MaybeSetNodeHeaderForTest shim, move test internal

2026-05-24 20:47:43 +00:00

node_header.go

refactor(openai): drop MaybeSetNodeHeaderForTest shim, move test internal

2026-05-24 20:47:43 +00:00

openai_suite_test.go

Fix image upload processing and img2img pipeline in diffusers backend (#8879 )

2026-03-11 08:05:50 +01:00

realtime_gate_test.go

feat(realtime): Add Liquid Audio s2s model and assistant mode on talk page (#9801 )

2026-05-13 21:57:27 +02:00

realtime_modality_test.go

realtime: honor output_modalities to skip TTS in text-only mode (#9838 )

2026-05-15 12:39:47 +02:00

realtime_model.go

feat(whisper): honor client cancellation via ggml abort_callback (#9710 )

2026-05-08 01:44:47 +02:00

realtime_transport_webrtc.go

feat: add distributed mode (#9124 )

2026-03-30 00:47:27 +02:00

realtime_transport_ws.go

feat(realtime): WebRTC support (#8790 )

2026-03-13 21:37:15 +01:00

realtime_transport.go

feat(realtime): WebRTC support (#8790 )

2026-03-13 21:37:15 +01:00

realtime_webrtc.go

feat(realtime): Add Liquid Audio s2s model and assistant mode on talk page (#9801 )

2026-05-13 21:57:27 +02:00

realtime.go

realtime: honor output_modalities to skip TTS in text-only mode (#9838 )

2026-05-15 12:39:47 +02:00

transcription.go

feat(whisper): honor client cancellation via ggml abort_callback (#9710 )

2026-05-08 01:44:47 +02:00