mirror of
https://github.com/ollama/ollama.git
synced 2026-06-03 05:53:55 -04:00
Previously the draft architecture was hardcoded to Gemma4AssistantForCausalLM. Read it from the draft model's config so any draft architecture can be packaged.