mirror of
https://github.com/mudler/LocalAI.git
synced 2026-05-18 05:33:09 -04:00
chore(gallery): cleanup old (superseded) archs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
This commit is contained in:
@@ -14068,427 +14068,6 @@
|
||||
- filename: deepseek-ai_DeepSeek-R1-0528-Qwen3-8B-Q4_K_M.gguf
|
||||
sha256: e0c2f118fd59f3a16f20d18b0e7f79e960c84bc8c66d94fd71a691e05151d54f
|
||||
uri: huggingface://bartowski/deepseek-ai_DeepSeek-R1-0528-Qwen3-8B-GGUF/deepseek-ai_DeepSeek-R1-0528-Qwen3-8B-Q4_K_M.gguf
|
||||
- &qwen2
|
||||
url: "github:mudler/LocalAI/gallery/chatml.yaml@master" ## Start QWEN2
|
||||
name: "qwen2-7b-instruct"
|
||||
icon: https://avatars.githubusercontent.com/u/141221163
|
||||
license: apache-2.0
|
||||
description: |
|
||||
Qwen2 is the new series of Qwen large language models. For Qwen2, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters, including a Mixture-of-Experts model. This repo contains the instruction-tuned 7B Qwen2 model.
|
||||
urls:
|
||||
- https://huggingface.co/Qwen/Qwen2-7B-Instruct
|
||||
- https://huggingface.co/bartowski/Qwen2-7B-Instruct-GGUF
|
||||
tags:
|
||||
- llm
|
||||
- gguf
|
||||
- gpu
|
||||
- qwen
|
||||
- cpu
|
||||
overrides:
|
||||
parameters:
|
||||
model: Qwen2-7B-Instruct-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: Qwen2-7B-Instruct-Q4_K_M.gguf
|
||||
sha256: 8d0d33f0d9110a04aad1711b1ca02dafc0fa658cd83028bdfa5eff89c294fe76
|
||||
uri: huggingface://bartowski/Qwen2-7B-Instruct-GGUF/Qwen2-7B-Instruct-Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "dolphin-2.9.2-qwen2-72b"
|
||||
icon: https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/ldkN1J0WIDQwU4vutGYiD.png
|
||||
urls:
|
||||
- https://huggingface.co/cognitivecomputations/dolphin-2.9.2-qwen2-72b-gguf
|
||||
description: "Dolphin 2.9.2 Qwen2 72B \U0001F42C\n\nCurated and trained by Eric Hartford, Lucas Atkins, and Fernando Fernandes, and Cognitive Computations\n"
|
||||
overrides:
|
||||
parameters:
|
||||
model: dolphin-2.9.2-qwen2-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: dolphin-2.9.2-qwen2-Q4_K_M.gguf
|
||||
sha256: 44a0e82cbc2a201b2f4b9e16099a0a4d97b6f0099d45bcc5b354601f38dbb709
|
||||
uri: huggingface://cognitivecomputations/dolphin-2.9.2-qwen2-72b-gguf/qwen2-Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "dolphin-2.9.2-qwen2-7b"
|
||||
description: "Dolphin 2.9.2 Qwen2 7B \U0001F42C\n\nCurated and trained by Eric Hartford, Lucas Atkins, and Fernando Fernandes, and Cognitive Computations\n"
|
||||
urls:
|
||||
- https://huggingface.co/cognitivecomputations/dolphin-2.9.2-qwen2-7b
|
||||
- https://huggingface.co/cognitivecomputations/dolphin-2.9.2-qwen2-7b-gguf
|
||||
icon: https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/ldkN1J0WIDQwU4vutGYiD.png
|
||||
overrides:
|
||||
parameters:
|
||||
model: dolphin-2.9.2-qwen2-7b-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: dolphin-2.9.2-qwen2-7b-Q4_K_M.gguf
|
||||
sha256: a15b5db4df6be4f4bfb3632b2009147332ef4c57875527f246b4718cb0d3af1f
|
||||
uri: huggingface://cognitivecomputations/dolphin-2.9.2-qwen2-7b-gguf/dolphin-2.9.2-qwen2-7b-Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "samantha-qwen-2-7B"
|
||||
description: |
|
||||
Samantha based on qwen2
|
||||
urls:
|
||||
- https://huggingface.co/bartowski/Samantha-Qwen-2-7B-GGUF
|
||||
- https://huggingface.co/macadeliccc/Samantha-Qwen2-7B
|
||||
overrides:
|
||||
parameters:
|
||||
model: Samantha-Qwen-2-7B-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: Samantha-Qwen-2-7B-Q4_K_M.gguf
|
||||
sha256: 5d1cf1c35a7a46c536a96ba0417d08b9f9e09c24a4e25976f72ad55d4904f6fe
|
||||
uri: huggingface://bartowski/Samantha-Qwen-2-7B-GGUF/Samantha-Qwen-2-7B-Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "magnum-72b-v1"
|
||||
icon: https://files.catbox.moe/ngqnb1.png
|
||||
description: |
|
||||
This is the first in a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet and Opus. This model is fine-tuned on top of Qwen-2 72B Instruct.
|
||||
urls:
|
||||
- https://huggingface.co/alpindale/magnum-72b-v1
|
||||
- https://huggingface.co/bartowski/magnum-72b-v1-GGUF
|
||||
overrides:
|
||||
parameters:
|
||||
model: magnum-72b-v1-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: magnum-72b-v1-Q4_K_M.gguf
|
||||
sha256: 046ec48665ce64a3a4965509dee2d9d8e5d81cb0b32ca0ddf130d2b59fa4ca9a
|
||||
uri: huggingface://bartowski/magnum-72b-v1-GGUF/magnum-72b-v1-Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "qwen2-1.5b-ita"
|
||||
description: |
|
||||
Qwen2 1.5B is a compact language model specifically fine-tuned for the Italian language. Despite its relatively small size of 1.5 billion parameters, Qwen2 1.5B demonstrates strong performance, nearly matching the capabilities of larger models, such as the 9 billion parameter ITALIA model by iGenius. The fine-tuning process focused on optimizing the model for various language tasks in Italian, making it highly efficient and effective for Italian language applications.
|
||||
urls:
|
||||
- https://huggingface.co/DeepMount00/Qwen2-1.5B-Ita
|
||||
- https://huggingface.co/DeepMount00/Qwen2-1.5B-Ita-GGUF
|
||||
overrides:
|
||||
parameters:
|
||||
model: qwen2-1.5b-instruct-q8_0.gguf
|
||||
files:
|
||||
- filename: qwen2-1.5b-instruct-q8_0.gguf
|
||||
sha256: c9d33989d77f4bd6966084332087921b9613eda01d5f44dc0b4e9a7382a2bfbb
|
||||
uri: huggingface://DeepMount00/Qwen2-1.5B-Ita-GGUF/qwen2-1.5b-instruct-q8_0.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "einstein-v7-qwen2-7b"
|
||||
icon: https://cdn-uploads.huggingface.co/production/uploads/6468ce47e134d050a58aa89c/KLQP1jK-DIzpwHzYRIH-Q.png
|
||||
description: |
|
||||
This model is a full fine-tuned version of Qwen/Qwen2-7B on diverse datasets.
|
||||
urls:
|
||||
- https://huggingface.co/Weyaxi/Einstein-v7-Qwen2-7B
|
||||
- https://huggingface.co/bartowski/Einstein-v7-Qwen2-7B-GGUF
|
||||
overrides:
|
||||
parameters:
|
||||
model: Einstein-v7-Qwen2-7B-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: Einstein-v7-Qwen2-7B-Q4_K_M.gguf
|
||||
sha256: 277b212ea65894723d2b86fb0f689fa5ecb54c9794f0fd2fb643655dc62812ce
|
||||
uri: huggingface://bartowski/Einstein-v7-Qwen2-7B-GGUF/Einstein-v7-Qwen2-7B-Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "arcee-spark"
|
||||
icon: https://avatars.githubusercontent.com/u/126496414
|
||||
description: |
|
||||
Arcee Spark is a powerful 7B parameter language model that punches well above its weight class. Initialized from Qwen2, this model underwent a sophisticated training process:
|
||||
|
||||
Fine-tuned on 1.8 million samples
|
||||
Merged with Qwen2-7B-Instruct using Arcee's mergekit
|
||||
Further refined using Direct Preference Optimization (DPO)
|
||||
|
||||
This meticulous process results in exceptional performance, with Arcee Spark achieving the highest score on MT-Bench for models of its size, outperforming even GPT-3.5 on many tasks.
|
||||
urls:
|
||||
- https://huggingface.co/arcee-ai/Arcee-Spark-GGUF
|
||||
overrides:
|
||||
parameters:
|
||||
model: Arcee-Spark-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: Arcee-Spark-Q4_K_M.gguf
|
||||
sha256: 44123276d7845dc13f73ca4aa431dc4c931104eb7d2186f2a73d076fa0ee2330
|
||||
uri: huggingface://arcee-ai/Arcee-Spark-GGUF/Arcee-Spark-Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "hercules-5.0-qwen2-7b"
|
||||
description: |
|
||||
Locutusque/Hercules-5.0-Qwen2-7B is a fine-tuned language model derived from Qwen2-7B. It is specifically designed to excel in instruction following, function calls, and conversational interactions across various scientific and technical domains. This fine-tuning has hercules-v5.0 with enhanced abilities in:
|
||||
|
||||
Complex Instruction Following: Understanding and accurately executing multi-step instructions, even those involving specialized terminology.
|
||||
Function Calling: Seamlessly interpreting and executing function calls, providing appropriate input and output values.
|
||||
Domain-Specific Knowledge: Engaging in informative and educational conversations about Biology, Chemistry, Physics, Mathematics, Medicine, Computer Science, and more.
|
||||
urls:
|
||||
- https://huggingface.co/Locutusque/Hercules-5.0-Qwen2-7B
|
||||
- https://huggingface.co/bartowski/Hercules-5.0-Qwen2-7B-GGUF
|
||||
overrides:
|
||||
parameters:
|
||||
model: Hercules-5.0-Qwen2-7B-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: Hercules-5.0-Qwen2-7B-Q4_K_M.gguf
|
||||
sha256: 8ebae4ffd43b906ddb938c3a611060ee5f99c35014e5ffe23ca35714361b5693
|
||||
uri: huggingface://Hercules-5.0-Qwen2-7B-Q4_K_M.gguf/Hercules-5.0-Qwen2-7B-Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "arcee-agent"
|
||||
icon: https://avatars.githubusercontent.com/u/126496414
|
||||
description: |
|
||||
Arcee Agent is a cutting-edge 7B parameter language model specifically designed for function calling and tool use. Initialized from Qwen2-7B, it rivals the performance of much larger models while maintaining efficiency and speed. This model is particularly suited for developers, researchers, and businesses looking to implement sophisticated AI-driven solutions without the computational overhead of larger language models. Compute for training Arcee-Agent was provided by CrusoeAI. Arcee-Agent was trained using Spectrum.
|
||||
urls:
|
||||
- https://huggingface.co/crusoeai/Arcee-Agent-GGUF
|
||||
- https://huggingface.co/arcee-ai/Arcee-Agent
|
||||
overrides:
|
||||
parameters:
|
||||
model: arcee-agent.Q4_K_M.gguf
|
||||
files:
|
||||
- filename: arcee-agent.Q4_K_M.gguf
|
||||
sha256: ebb49943a66c1e717f9399a555aee0af28a40bfac7500f2ad8dd05f211b62aac
|
||||
uri: huggingface://crusoeai/Arcee-Agent-GGUF/arcee-agent.Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "qwen2-7b-instruct-v0.8"
|
||||
icon: https://huggingface.co/MaziyarPanahi/Qwen2-7B-Instruct-v0.8/resolve/main/qwen2-fine-tunes-maziyar-panahi.webp
|
||||
description: |
|
||||
MaziyarPanahi/Qwen2-7B-Instruct-v0.8
|
||||
|
||||
This is a fine-tuned version of the Qwen/Qwen2-7B model. It aims to improve the base model across all benchmarks.
|
||||
urls:
|
||||
- https://huggingface.co/MaziyarPanahi/Qwen2-7B-Instruct-v0.8
|
||||
- https://huggingface.co/MaziyarPanahi/Qwen2-7B-Instruct-v0.8-GGUF
|
||||
overrides:
|
||||
parameters:
|
||||
model: Qwen2-7B-Instruct-v0.8.Q4_K_M.gguf
|
||||
files:
|
||||
- filename: Qwen2-7B-Instruct-v0.8.Q4_K_M.gguf
|
||||
sha256: 8c1b3efe9fa6ae1b37942ef26473cb4e0aed0f8038b60d4b61e5bffb61e49b7e
|
||||
uri: huggingface://MaziyarPanahi/Qwen2-7B-Instruct-v0.8-GGUF/Qwen2-7B-Instruct-v0.8.Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "qwen2-wukong-7b"
|
||||
icon: https://cdn-uploads.huggingface.co/production/uploads/655dc641accde1bbc8b41aec/xOe1Nb3S9Nb53us7_Ja3s.jpeg
|
||||
urls:
|
||||
- https://huggingface.co/bartowski/Qwen2-Wukong-7B-GGUF
|
||||
description: |
|
||||
Qwen2-Wukong-7B is a dealigned chat finetune of the original fantastic Qwen2-7B model by the Qwen team.
|
||||
|
||||
This model was trained on the teknium OpenHeremes-2.5 dataset and some supplementary datasets from Cognitive Computations
|
||||
|
||||
This model was trained for 3 epochs with a custom FA2 implementation for AMD cards.
|
||||
overrides:
|
||||
parameters:
|
||||
model: Qwen2-Wukong-7B-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: Qwen2-Wukong-7B-Q4_K_M.gguf
|
||||
sha256: 6b8ca6649c33fc84d4892ebcff1214f0b34697aced784f0d6d32e284a15943ad
|
||||
uri: huggingface://bartowski/Qwen2-Wukong-7B-GGUF/Qwen2-Wukong-7B-Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "calme-2.8-qwen2-7b"
|
||||
icon: https://huggingface.co/MaziyarPanahi/calme-2.8-qwen2-7b/resolve/main/qwen2-fine-tunes-maziyar-panahi.webp
|
||||
urls:
|
||||
- https://huggingface.co/MaziyarPanahi/calme-2.8-qwen2-7b
|
||||
- https://huggingface.co/MaziyarPanahi/calme-2.8-qwen2-7b-GGUF
|
||||
description: |
|
||||
This is a fine-tuned version of the Qwen/Qwen2-7B model. It aims to improve the base model across all benchmarks.
|
||||
overrides:
|
||||
parameters:
|
||||
model: Qwen2-7B-Instruct-v0.8.Q4_K_M.gguf
|
||||
files:
|
||||
- filename: Qwen2-7B-Instruct-v0.8.Q4_K_M.gguf
|
||||
sha256: 8c1b3efe9fa6ae1b37942ef26473cb4e0aed0f8038b60d4b61e5bffb61e49b7e
|
||||
uri: huggingface://MaziyarPanahi/calme-2.8-qwen2-7b-GGUF/Qwen2-7B-Instruct-v0.8.Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "stellardong-72b-i1"
|
||||
icon: https://huggingface.co/smelborp/StellarDong-72b/resolve/main/stellardong.png
|
||||
urls:
|
||||
- https://huggingface.co/smelborp/StellarDong-72b
|
||||
- https://huggingface.co/mradermacher/StellarDong-72b-i1-GGUF
|
||||
description: |
|
||||
Magnum + Nova = you won't believe how stellar this dong is!!
|
||||
overrides:
|
||||
parameters:
|
||||
model: StellarDong-72b.i1-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: StellarDong-72b.i1-Q4_K_M.gguf
|
||||
sha256: 4c5012f0a034f40a044904891343ade2594f29c28a8a9d8052916de4dc5a61df
|
||||
uri: huggingface://mradermacher/StellarDong-72b-i1-GGUF/StellarDong-72b.i1-Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "magnum-32b-v1-i1"
|
||||
icon: https://cdn-uploads.huggingface.co/production/uploads/635567189c72a7e742f1419c/PK7xRSd18Du0bX-w_t-9c.png
|
||||
urls:
|
||||
- https://huggingface.co/anthracite-org/magnum-32b-v1
|
||||
- https://huggingface.co/mradermacher/magnum-32b-v1-i1-GGUF
|
||||
description: |
|
||||
This is the second in a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet and Opus. This model is fine-tuned on top of Qwen1.5 32B.
|
||||
overrides:
|
||||
parameters:
|
||||
model: magnum-32b-v1.i1-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: magnum-32b-v1.i1-Q4_K_M.gguf
|
||||
sha256: a31704ce0d7e5b774f155522b9ab7ef6015a4ece4e9056bf4dfc6cac561ff0a3
|
||||
uri: huggingface://mradermacher/magnum-32b-v1-i1-GGUF/magnum-32b-v1.i1-Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "tifa-7b-qwen2-v0.1"
|
||||
urls:
|
||||
- https://huggingface.co/Tifa-RP/Tifa-7B-Qwen2-v0.1-GGUF
|
||||
description: |
|
||||
The Tifa role-playing language model is a high-performance language model based on a self-developed 220B model distillation, with a new base model of qwen2-7B. The model has been converted to gguf format for running in the Ollama framework, providing excellent dialogue and text generation capabilities.
|
||||
|
||||
The original model was trained on a large-scale industrial dataset and then fine-tuned with 400GB of novel data and 20GB of multi-round dialogue directive data to achieve good role-playing effects.
|
||||
|
||||
The Tifa model is suitable for multi-round dialogue processing, role-playing and scenario simulation, EFX industrial knowledge integration, and high-quality literary creation.
|
||||
|
||||
Note: The Tifa model is in Chinese and English, with 7.6% of the data in Chinese role-playing and 4.2% in English role-playing. The model has been trained with a mix of EFX industrial field parameters and question-answer dialogues generated from 220B model outputs since 2023. The recommended quantization method is f16, as it retains more detail and accuracy in the model's performance.
|
||||
overrides:
|
||||
parameters:
|
||||
model: tifa-7b-qwen2-v0.1.q4_k_m.gguf
|
||||
files:
|
||||
- filename: tifa-7b-qwen2-v0.1.q4_k_m.gguf
|
||||
sha256: 1f5adbe8cb0a6400f51abdca3bf4e32284ebff73cc681a43abb35c0a6ccd3820
|
||||
uri: huggingface://Tifa-RP/Tifa-7B-Qwen2-v0.1-GGUF/tifa-7b-qwen2-v0.1.q4_k_m.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "calme-2.2-qwen2-72b"
|
||||
icon: https://huggingface.co/MaziyarPanahi/calme-2.2-qwen2-72b/resolve/main/calme-2.webp
|
||||
urls:
|
||||
- https://huggingface.co/MaziyarPanahi/calme-2.2-qwen2-72b-GGUF
|
||||
- https://huggingface.co/MaziyarPanahi/calme-2.2-qwen2-72b
|
||||
description: |
|
||||
This model is a fine-tuned version of the powerful Qwen/Qwen2-72B-Instruct, pushing the boundaries of natural language understanding and generation even further. My goal was to create a versatile and robust model that excels across a wide range of benchmarks and real-world applications.
|
||||
|
||||
The post-training process is identical to the calme-2.1-qwen2-72b model; however, some parameters are different, and it was trained for a longer period.
|
||||
|
||||
Use Cases
|
||||
|
||||
This model is suitable for a wide range of applications, including but not limited to:
|
||||
|
||||
Advanced question-answering systems
|
||||
Intelligent chatbots and virtual assistants
|
||||
Content generation and summarization
|
||||
Code generation and analysis
|
||||
Complex problem-solving and decision support
|
||||
overrides:
|
||||
parameters:
|
||||
model: calme-2.2-qwen2-72b.Q4_K_M.gguf
|
||||
files:
|
||||
- filename: calme-2.2-qwen2-72b.Q4_K_M.gguf
|
||||
sha256: 95b9613df0abe6c1b6b7b017d7cc8bcf19b46c29f92a503dcc6da1704b12b402
|
||||
uri: huggingface://MaziyarPanahi/calme-2.2-qwen2-72b-GGUF/calme-2.2-qwen2-72b.Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "edgerunner-tactical-7b"
|
||||
icon: https://cdn-uploads.huggingface.co/production/uploads/668ed3dcd857a9ca47edb75c/tSyuw39VtmEqvC_wptTDf.png
|
||||
urls:
|
||||
- https://huggingface.co/edgerunner-ai/EdgeRunner-Tactical-7B
|
||||
- https://huggingface.co/RichardErkhov/edgerunner-ai_-_EdgeRunner-Tactical-7B-gguf
|
||||
description: |
|
||||
EdgeRunner-Tactical-7B is a powerful and efficient language model for the edge. Our mission is to build Generative AI for the edge that is safe, secure, and transparent. To that end, the EdgeRunner team is proud to release EdgeRunner-Tactical-7B, the most powerful language model for its size to date.
|
||||
|
||||
EdgeRunner-Tactical-7B is a 7 billion parameter language model that delivers powerful performance while demonstrating the potential of running state-of-the-art (SOTA) models at the edge.
|
||||
overrides:
|
||||
parameters:
|
||||
model: EdgeRunner-Tactical-7B.Q4_K_M.gguf
|
||||
files:
|
||||
- filename: EdgeRunner-Tactical-7B.Q4_K_M.gguf
|
||||
sha256: 90ca9c3ab19e5d1de4499e3f988cc0ba3d205e50285d7c89de6f0a4c525bf204
|
||||
uri: huggingface://RichardErkhov/edgerunner-ai_-_EdgeRunner-Tactical-7B-gguf/EdgeRunner-Tactical-7B.Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "marco-o1"
|
||||
icon: https://huggingface.co/AIDC-AI/Marco-o1/resolve/main/assets/logo.png
|
||||
urls:
|
||||
- https://huggingface.co/AIDC-AI/Marco-o1
|
||||
- https://huggingface.co/QuantFactory/Marco-o1-GGUF
|
||||
description: |
|
||||
Marco-o1 not only focuses on disciplines with standard answers, such as mathematics, physics, and coding—which are well-suited for reinforcement learning (RL)—but also places greater emphasis on open-ended resolutions. We aim to address the question: "Can the o1 model effectively generalize to broader domains where clear standards are absent and rewards are challenging to quantify?"
|
||||
overrides:
|
||||
parameters:
|
||||
model: Marco-o1.Q4_K_M.gguf
|
||||
files:
|
||||
- filename: Marco-o1.Q4_K_M.gguf
|
||||
sha256: 54dd9554cb54609bf0bf4b367dfba192fc982a2fc6b87a0f56fba5ea82762d0d
|
||||
uri: huggingface://QuantFactory/Marco-o1-GGUF/Marco-o1.Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "marco-o1-uncensored"
|
||||
urls:
|
||||
- https://huggingface.co/thirdeyeai/marco-o1-uncensored
|
||||
- https://huggingface.co/QuantFactory/marco-o1-uncensored-GGUF
|
||||
description: |
|
||||
Uncensored version of marco-o1
|
||||
overrides:
|
||||
parameters:
|
||||
model: marco-o1-uncensored.Q4_K_M.gguf
|
||||
files:
|
||||
- filename: marco-o1-uncensored.Q4_K_M.gguf
|
||||
sha256: ad0440270a7254098f90779744d3e5b34fe49b7baf97c819909ba9c5648cc0d9
|
||||
uri: huggingface://QuantFactory/marco-o1-uncensored-GGUF/marco-o1-uncensored.Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "minicpm-o-2_6"
|
||||
icon: https://avatars.githubusercontent.com/u/89920203
|
||||
urls:
|
||||
- https://huggingface.co/openbmb/MiniCPM-o-2_6-gguf
|
||||
- https://huggingface.co/openbmb/MiniCPM-o-2_6
|
||||
description: |
|
||||
MiniCPM-o 2.6 is the latest and most capable model in the MiniCPM-o series. The model is built in an end-to-end fashion based on SigLip-400M, Whisper-medium-300M, ChatTTS-200M, and Qwen2.5-7B with a total of 8B parameters
|
||||
tags:
|
||||
- llm
|
||||
- multimodal
|
||||
- gguf
|
||||
- gpu
|
||||
- qwen2
|
||||
- cpu
|
||||
overrides:
|
||||
mmproj: minicpm-o-2_6-mmproj-f16.gguf
|
||||
parameters:
|
||||
model: minicpm-o-2_6-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: minicpm-o-2_6-Q4_K_M.gguf
|
||||
sha256: 4f635fc0c0bb88d50ccd9cf1f1e5892b5cb085ff88fe0d8e1148fd9a8a836bc2
|
||||
uri: huggingface://openbmb/MiniCPM-o-2_6-gguf/Model-7.6B-Q4_K_M.gguf
|
||||
- filename: minicpm-o-2_6-mmproj-f16.gguf
|
||||
sha256: efa4f7d96aa0f838f2023fc8d28e519179b16f1106777fa9280b32628191aa3e
|
||||
uri: huggingface://openbmb/MiniCPM-o-2_6-gguf/mmproj-model-f16.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "minicpm-v-2_6"
|
||||
license: apache-2.0
|
||||
icon: https://avatars.githubusercontent.com/u/89920203
|
||||
urls:
|
||||
- https://huggingface.co/openbmb/MiniCPM-V-2_6-gguf
|
||||
- https://huggingface.co/openbmb/MiniCPM-V-2_6
|
||||
description: |
|
||||
MiniCPM-V 2.6 is the latest and most capable model in the MiniCPM-V series. The model is built on SigLip-400M and Qwen2-7B with a total of 8B parameters
|
||||
tags:
|
||||
- llm
|
||||
- multimodal
|
||||
- gguf
|
||||
- gpu
|
||||
- qwen2
|
||||
- cpu
|
||||
overrides:
|
||||
mmproj: minicpm-v-2_6-mmproj-f16.gguf
|
||||
parameters:
|
||||
model: minicpm-v-2_6-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: minicpm-v-2_6-Q4_K_M.gguf
|
||||
sha256: 3a4078d53b46f22989adbf998ce5a3fd090b6541f112d7e936eb4204a04100b1
|
||||
uri: huggingface://openbmb/MiniCPM-V-2_6-gguf/ggml-model-Q4_K_M.gguf
|
||||
- filename: minicpm-v-2_6-mmproj-f16.gguf
|
||||
uri: huggingface://openbmb/MiniCPM-V-2_6-gguf/mmproj-model-f16.gguf
|
||||
sha256: 4485f68a0f1aa404c391e788ea88ea653c100d8e98fe572698f701e5809711fd
|
||||
- !!merge <<: *qwen2
|
||||
name: "taid-llm-1.5b"
|
||||
icon: https://sakana.ai/assets/taid-jp/cover_large.jpeg
|
||||
urls:
|
||||
- https://huggingface.co/SakanaAI/TAID-LLM-1.5B
|
||||
- https://huggingface.co/bartowski/TAID-LLM-1.5B-GGUF
|
||||
description: |
|
||||
TAID-LLM-1.5B is an English language model created through TAID (Temporally Adaptive Interpolated Distillation), our new knowledge distillation method. We used Qwen2-72B-Instruct as the teacher model and Qwen2-1.5B-Instruct as the student model.
|
||||
overrides:
|
||||
parameters:
|
||||
model: TAID-LLM-1.5B-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: TAID-LLM-1.5B-Q4_K_M.gguf
|
||||
sha256: dbffc989d12d42ef8e4a2994e102d7ec7a02c49ec08ea2e35426372ad07b4cd8
|
||||
uri: huggingface://bartowski/TAID-LLM-1.5B-GGUF/TAID-LLM-1.5B-Q4_K_M.gguf
|
||||
- !!merge <<: *qwen2
|
||||
name: "agentflow_agentflow-planner-7b"
|
||||
urls:
|
||||
- https://huggingface.co/AgentFlow/agentflow-planner-7b
|
||||
- https://huggingface.co/bartowski/AgentFlow_agentflow-planner-7b-GGUF
|
||||
- https://huggingface.co/papers/date/2025-10-08
|
||||
- https://agentflow.stanford.edu/
|
||||
description: |
|
||||
AgentFlow Planner Agent 7B checkpoint (built upon Qwen2.5-7B-Instruct):
|
||||
Code: https://github.com/lupantech/AgentFlow
|
||||
Demo: https://huggingface.co/spaces/AgentFlow/agentflow
|
||||
Youtube: https://www.youtube.com/watch?v=kIQbCQIH1SI
|
||||
X (Twitter): https://x.com/lupantech/status/1976016000345919803
|
||||
overrides:
|
||||
parameters:
|
||||
model: AgentFlow_agentflow-planner-7b-Q4_K_M.gguf
|
||||
files:
|
||||
- filename: AgentFlow_agentflow-planner-7b-Q4_K_M.gguf
|
||||
sha256: 88e819fa904130a013e5619cd4d1e2a60711fcc2d8cb3cb092bf0915da4dff50
|
||||
uri: huggingface://bartowski/AgentFlow_agentflow-planner-7b-GGUF/AgentFlow_agentflow-planner-7b-Q4_K_M.gguf
|
||||
- &mistral03
|
||||
url: "github:mudler/LocalAI/gallery/mistral-0.3.yaml@master" ## START Mistral
|
||||
name: "mistral-7b-instruct-v0.3"
|
||||
|
||||
Reference in New Issue
Block a user