mirror of
https://github.com/mudler/LocalAI.git
synced 2026-06-22 15:49:12 -04:00
* feat(config): add model alias field and self-validation Add ModelConfig.Alias (yaml: alias), IsAlias(), and an alias short-circuit at the top of Validate() that rejects self-reference and forbids setting backend/parameters.model on a pure-redirect alias. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(config): resolve and validate model alias targets in the loader Assisted-by: Claude:opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(middleware): resolve model aliases and stamp requested/served identity Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(modeladmin): reject alias configs with invalid targets on create/edit Validate alias targets at create/swap entry points (ImportModelEndpoint, EditYAML, PatchConfig) so a dangling, chained, or disabled alias target is rejected at save time rather than surfacing as a runtime error. Assisted-by: Claude:opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(api): add GET /api/aliases to list model aliases Adds an admin-gated read-only endpoint that lists every model alias config as {name, target} pairs, backed by the loader's existing GetAllModelsConfigs(). Assisted-by: Claude:opus-4.8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(mcp): add set_alias and list_aliases tools Expose model-alias management over the LocalAI Assistant MCP surface: list_aliases (read-only, GET /api/aliases) and set_alias (mutating). SetAlias is swap-first: PATCH /api/models/config-json/:name swaps an existing alias's target (validated, non-destructive) and a 404 falls back to POST /models/import to create a fresh {name, alias} config. The inproc client mirrors this via ConfigService.PatchConfig + a create path modeled on ImportModelEndpoint. Deletion reuses delete_model. Assisted-by: Claude:claude-opus-4 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * style(mcp): replace em dashes in alias tool comments Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(config-meta): expose alias as a model-select field Add an 'alias' section to DefaultSections() and an 'alias' field override in DefaultRegistry() so the schema-driven React editor renders the new top-level ModelConfig.Alias field as a model picker in its own section. Assisted-by: Claude:opus-4.8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(ui): add alias template card and Manage alias badge Add an 'Alias / Routing' template to the create-flow gallery that seeds a minimal name + alias config, and a read-only 'alias -> target' badge on the Manage Models tab. The capabilities row payload does not carry the alias field, so the badge resolves targets from GET /api/aliases looked up by name. Assisted-by: Claude:claude-opus-4 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * docs: document model aliases Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * docs(swagger): regenerate for GET /api/aliases Adds the /api/aliases path and AliasInfo schema generated from the ListAliasesEndpoint annotation. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * test(localai): check os.RemoveAll error in aliases_test Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: correct alias conversion docs and advertise /api/aliases in instructions Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(mcp): write alias config 0600 to satisfy gosec G306 The inproc createAlias path wrote the alias YAML with 0644, which gosec flags as a new G306 finding on the PR. The LocalAI process is the sole reader/writer of model configs, so 0600 is correct and keeps the scan clean. Assisted-by: Claude:claude-opus-4-8 [Claude Code] Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>
105 lines
4.9 KiB
Go
105 lines
4.9 KiB
Go
package localaitools
|
|
|
|
import (
|
|
"context"
|
|
|
|
"github.com/mudler/LocalAI/core/config"
|
|
"github.com/mudler/LocalAI/core/gallery"
|
|
"github.com/mudler/LocalAI/core/schema"
|
|
"github.com/mudler/LocalAI/core/services/modeladmin"
|
|
"github.com/mudler/LocalAI/pkg/vram"
|
|
)
|
|
|
|
// LocalAIClient is the surface tools depend on. It has two implementations:
|
|
//
|
|
// - inproc.Client (in-process; calls LocalAI services directly)
|
|
// - httpapi.Client (out-of-process; calls the LocalAI REST API)
|
|
//
|
|
// Tool handlers and the embedded skill prompts are agnostic to which
|
|
// implementation backs the client.
|
|
//
|
|
// Where the same shape already exists elsewhere in the codebase
|
|
// (config.Gallery, gallery.Metadata, schema.KnownBackend, vram.EstimateResult,
|
|
// modeladmin.Action/Capability) we surface it directly rather than maintain
|
|
// a parallel DTO — keeping the LLM-visible wire format aligned with the
|
|
// rest of LocalAI by construction.
|
|
type LocalAIClient interface {
|
|
// ---- Models / gallery (read) ----
|
|
GallerySearch(ctx context.Context, q GallerySearchQuery) ([]gallery.Metadata, error)
|
|
ListInstalledModels(ctx context.Context, capability Capability) ([]InstalledModel, error)
|
|
ListGalleries(ctx context.Context) ([]config.Gallery, error)
|
|
GetJobStatus(ctx context.Context, jobID string) (*JobStatus, error)
|
|
GetModelConfig(ctx context.Context, name string) (*ModelConfigView, error)
|
|
|
|
// ---- Models / gallery (write) ----
|
|
InstallModel(ctx context.Context, req InstallModelRequest) (jobID string, err error)
|
|
DeleteModel(ctx context.Context, name string) error
|
|
EditModelConfig(ctx context.Context, name string, patch map[string]any) error
|
|
ReloadModels(ctx context.Context) error
|
|
ImportModelURI(ctx context.Context, req ImportModelURIRequest) (*ImportModelURIResponse, error)
|
|
|
|
// ---- Model aliases ----
|
|
// SetAlias creates the alias `name` pointing at `target`, or swaps an
|
|
// existing alias's target. The server validates that `target` is an
|
|
// existing, non-alias, enabled model. Deletion reuses DeleteModel.
|
|
SetAlias(ctx context.Context, name, target string) error
|
|
// ListAliases returns every configured alias and its target.
|
|
ListAliases(ctx context.Context) ([]AliasInfo, error)
|
|
|
|
// ---- Backends ----
|
|
// ListBackends returns installed backends. The shape stays a thin
|
|
// localaitools.Backend rather than gallery.SystemBackend because the
|
|
// latter carries filesystem paths (RunFile, Metadata) the LLM
|
|
// shouldn't see.
|
|
ListBackends(ctx context.Context) ([]Backend, error)
|
|
// ListKnownBackends returns the same shape as REST /backends/known.
|
|
ListKnownBackends(ctx context.Context) ([]schema.KnownBackend, error)
|
|
InstallBackend(ctx context.Context, req InstallBackendRequest) (jobID string, err error)
|
|
UpgradeBackend(ctx context.Context, name string) (jobID string, err error)
|
|
|
|
// ---- System ----
|
|
SystemInfo(ctx context.Context) (*SystemInfo, error)
|
|
ListNodes(ctx context.Context) ([]Node, error)
|
|
VRAMEstimate(ctx context.Context, req VRAMEstimateRequest) (*vram.EstimateResult, error)
|
|
|
|
// ---- State ----
|
|
// ToggleModelState accepts modeladmin.ActionEnable / ActionDisable.
|
|
ToggleModelState(ctx context.Context, name string, action modeladmin.Action) error
|
|
// ToggleModelPinned accepts modeladmin.ActionPin / ActionUnpin.
|
|
ToggleModelPinned(ctx context.Context, name string, action modeladmin.Action) error
|
|
|
|
// ---- Branding / whitelabeling ----
|
|
// GetBranding returns the configured instance branding (name, tagline,
|
|
// asset URLs).
|
|
GetBranding(ctx context.Context) (*Branding, error)
|
|
// SetBranding updates the text branding fields. Asset uploads are not
|
|
// exposed over MCP — admins use the Settings UI for binary files.
|
|
SetBranding(ctx context.Context, req SetBrandingRequest) (*Branding, error)
|
|
|
|
// ---- Usage / billing ----
|
|
|
|
// GetUsageStats returns aggregated token usage. In single-user
|
|
// no-auth mode this reports the synthetic local user's usage. The
|
|
// implementation enforces "admin required to query other users".
|
|
GetUsageStats(ctx context.Context, q UsageStatsQuery) (*UsageStats, error)
|
|
|
|
// ---- PII filter ----
|
|
// GetPIIEvents returns recent redaction events. Implementation
|
|
// enforces "admin required" when auth is on. The regex pattern tools
|
|
// were removed — detection policy lives on each detector model's
|
|
// pii_detection block, managed via the model-config tools.
|
|
GetPIIEvents(ctx context.Context, q PIIEventsQuery) ([]PIIEvent, error)
|
|
|
|
// ---- Middleware admin ----
|
|
// GetMiddlewareStatus returns the aggregated state surfaced on the
|
|
// /app/middleware page: active PII patterns, per-model resolved
|
|
// enabled state, recent event count, router placeholder.
|
|
GetMiddlewareStatus(ctx context.Context) (*MiddlewareStatus, error)
|
|
|
|
// ---- Router (intelligent routing) ----
|
|
// GetRouterDecisions returns recent routing decisions for the
|
|
// /app/middleware Routing tab and for agent-driven introspection.
|
|
// Admin-required when auth is on.
|
|
GetRouterDecisions(ctx context.Context, q RouterDecisionsQuery) ([]RouterDecision, error)
|
|
}
|