mirror of
https://github.com/mudler/LocalAI.git
synced 2026-05-17 21:21:23 -04:00
* feat: Rework VRAM estimation and use known_usecases in gallery Signed-off-by: Richard Palethorpe <io@richiejp.com> Assisted-by: Claude:claude-opus-4-7[1m] [Claude Code] * chore(gallery): regenerate gallery index and add known_usecases to model entries Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>
16 lines
331 B
YAML
16 lines
331 B
YAML
config_file: |
|
|
backend: llama-cpp
|
|
function:
|
|
grammar:
|
|
disable: true
|
|
known_usecases:
|
|
- chat
|
|
- completion
|
|
options:
|
|
- use_jinja:true
|
|
parameters:
|
|
model: llama-cpp/models/nanbeige4.1-3b-q8_0.gguf
|
|
template:
|
|
use_tokenizer_template: true
|
|
name: nanbeige4.1
|