From e41d8b65ce07b037433f79fa5a452792acf431d8 Mon Sep 17 00:00:00 2001 From: "LocalAI [bot]" <139863280+localai-bot@users.noreply.github.com> Date: Fri, 24 Oct 2025 09:04:00 +0200 Subject: [PATCH] chore(model gallery): :robot: add 1 new models via gallery agent (#6706) chore(model gallery): :robot: add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --- gallery/index.yaml | 53 ++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 53 insertions(+) diff --git a/gallery/index.yaml b/gallery/index.yaml index 108a7a044..5900442a2 100644 --- a/gallery/index.yaml +++ b/gallery/index.yaml @@ -22489,3 +22489,56 @@ - filename: PokeeAI.pokee_research_7b.Q4_K_M.gguf sha256: 4cf6e52b51f3595631b6e17ad2d5a5c8499d2e646cf9452dd32fb5ff7bbb37dd uri: huggingface://DevQuasar/PokeeAI.pokee_research_7b-GGUF/PokeeAI.pokee_research_7b.Q4_K_M.gguf +- !!merge <<: *gptoss + name: "gpt-oss-20b-esper3.1-i1" + urls: + - https://huggingface.co/mradermacher/gpt-oss-20b-Esper3.1-i1-GGUF + description: | + **Model Name:** gpt-oss-20b-Esper3.1 + **Repository:** [ValiantLabs/gpt-oss-20b-Esper3.1](https://huggingface.co/ValiantLabs/gpt-oss-20b-Esper3.1) + **Base Model:** openai/gpt-oss-20b + **Type:** Instruction-tuned, reasoning-focused language model + **Size:** 20 billion parameters + **License:** Apache 2.0 + + --- + + ### 🔍 **Overview** + gpt-oss-20b-Esper3.1 is a specialized, instruction-tuned variant of the 20B open-source GPT model, developed by **Valiant Labs**. It excels in **advanced coding, software architecture, and DevOps reasoning**, making it ideal for technical problem-solving and AI-driven engineering tasks. + + ### ✨ **Key Features** + - **Expert in DevOps & Cloud Systems:** Trained on high-difficulty datasets (e.g., Titanium3, Tachibana3, Mitakihara), it delivers precise, actionable guidance for AWS, Kubernetes, Terraform, Ansible, Docker, Jenkins, and more. + - **Strong Code Reasoning:** Optimized for complex programming tasks, including full-stack development, scripting, and debugging. + - **High-Quality Inference:** Uses `bf16` precision for full-precision performance; quantized versions (e.g., GGUF) available for efficient local inference. + - **Open-Source & Free to Use:** Fully open-access, built on the public gpt-oss-20b foundation and trained with community datasets. + + ### 📌 **Use Cases** + - Designing scalable cloud architectures + - Writing and optimizing infrastructure-as-code + - Debugging complex DevOps pipelines + - AI-assisted software development and documentation + - Real-time technical troubleshooting + + ### 💡 **Getting Started** + Use the standard `text-generation` pipeline with the `transformers` library. Supports role-based prompting (e.g., `user`, `assistant`) and performs best with high-reasoning prompts. + + ```python + from transformers import pipeline + + pipe = pipeline("text-generation", model="ValiantLabs/gpt-oss-20b-Esper3.1", torch_dtype="auto", device_map="auto") + messages = [{"role": "user", "content": "Design a Kubernetes cluster for a high-traffic web app with CI/CD via GitHub Actions."}] + outputs = pipe(messages, max_new_tokens=2000) + print(outputs[0]["generated_text"][-1]) + ``` + + --- + + > 🔗 **Model Gallery Entry**: + > *gpt-oss-20b-Esper3.1 – A powerful, open-source 20B model tuned for expert-level DevOps, coding, and system architecture. Built by Valiant Labs using high-quality technical datasets. Perfect for engineers, architects, and AI developers.* + overrides: + parameters: + model: gpt-oss-20b-Esper3.1.i1-Q4_K_M.gguf + files: + - filename: gpt-oss-20b-Esper3.1.i1-Q4_K_M.gguf + sha256: 079683445913d12e70449a10b9e1bfc8adaf1e7917e86cf3be3cb29cca186f11 + uri: huggingface://mradermacher/gpt-oss-20b-Esper3.1-i1-GGUF/gpt-oss-20b-Esper3.1.i1-Q4_K_M.gguf