Commit Graph

  • 5b5c87f5fb mlxrunner: introduce AttentionMask and rework cache per-forward views jessegross/batching Jesse Gross 2026-04-22 12:44:44 -07:00
  • 225e6a24fb mlxrunner: apply RoPE at per-row positions Jesse Gross 2026-04-21 16:15:59 -07:00
  • 45fe08245f mlxrunner: wrap model forward inputs in a Batch struct Jesse Gross 2026-04-21 15:29:07 -07:00
  • 4de8c2a438 mlxrunner: batch the sampler across multiple sequences Jesse Gross 2026-04-20 17:40:05 -07:00
  • 987caa4516 mlxrunner: track sampler history in a fixed-size ring buffer Jesse Gross 2026-04-20 10:41:14 -07:00
  • 0ae8af129a fix test hoyyeva/fix-launch-app-process-reap Eva Ho 2026-04-24 17:19:03 -04:00
  • f0efe7d907 address comments Eva Ho 2026-04-24 16:52:25 -04:00
  • 81b9cb7fa9 ollama pull manifest list support pdevine/manifest-list Patrick Devine 2026-04-24 12:40:12 -07:00
  • f266fbf17c update image hoyyeva/launch-page-update Eva Ho 2026-04-24 14:47:51 -04:00
  • 7fd96eba96 ollama push w/ manifest lists Patrick Devine 2026-04-24 08:41:11 -07:00
  • ea01af6f76 openai: map responses reasoning effort to think (#15789) main v0.21.3-rc0 Parth Sareen 2026-04-24 02:49:36 -07:00
  • c2ebb4d57c api: accept "max" as a think value (#15787) Parth Sareen 2026-04-24 01:49:39 -07:00
  • 2dcc80204d hide the --runner flag in ollama run Patrick Devine 2026-04-23 17:39:46 -07:00
  • 0d863c8cf4 add ollama show cli Patrick Devine 2026-04-23 17:37:27 -07:00
  • f636014ac7 add manifest list support to /api/show Patrick Devine 2026-04-23 17:03:03 -07:00
  • 590109c835 launch: harden OpenClaw onboarding flow (#15777) v0.21.2-rc1 v0.21.2 Parth Sareen 2026-04-23 16:47:20 -07:00
  • b4442c6d17 launch: resave managed integration config when live config drifts (#15776) Eva H 2026-04-23 19:32:36 -04:00
  • 0cd8a0a442 launch: add codex model metadata catalog hoyyeva/fix-codex-model-metadata-warning codex/fix-codex-model-metadata-warning Eva Ho 2026-04-23 17:09:41 -04:00
  • 85ff8e4a21 launch: keep launch recommended models in a fixed canonical order (#15750) Eva H 2026-04-23 16:33:00 -04:00
  • 00188139f1 manifest lists: fix size calculation in ollama ls Patrick Devine 2026-04-23 11:07:54 -07:00
  • dca6ae344b update png to svg Eva Ho 2026-04-22 22:54:05 -04:00
  • 9658029516 more manifest list stuff Patrick Devine 2026-04-22 18:51:45 -07:00
  • 160660e572 launch: use bundled OpenClaw ollama web search (#15757) v0.21.2-rc0 Parth Sareen 2026-04-22 16:34:19 -07:00
  • b38fb56c69 remove unused test and image Eva Ho 2026-04-22 17:32:45 -04:00
  • 843e43dec1 clean up Eva Ho 2026-04-22 17:30:59 -04:00
  • 832197c0f0 update to svg Eva Ho 2026-04-22 17:21:17 -04:00
  • 5fba0ca22b clean up Eva Ho 2026-04-22 17:13:57 -04:00
  • d12868137b simplified implementation Eva Ho 2026-04-22 17:09:26 -04:00
  • e4c2ccc8e4 cleanup Eva Ho 2026-04-22 15:10:17 -04:00
  • eb48bee816 clean up Eva Ho 2026-04-22 15:00:09 -04:00
  • 001f50e117 app: align the app launch page with ollama launch Eva Ho 2026-04-22 14:46:23 -04:00
  • 3b43b9bc4b docs: update structured outputs doc for cloud (#15733) madflow 2026-04-22 09:42:39 +02:00
  • 961ae1b10c introduce manifest lists Patrick Devine 2026-04-21 15:41:52 -07:00
  • 21883571b7 launch: replace kimi-k2.5 with k2.6 as top recommended model (#15737) v0.21.1-rc1 v0.21.1 Parth Sareen 2026-04-21 15:13:20 -07:00
  • a50199cd70 mlxrunner: batch the sampler across multiple sequences jessegross/sampler-batch Jesse Gross 2026-04-20 17:40:05 -07:00
  • 5264ba9194 mlxrunner: track sampler history in a fixed-size ring buffer Jesse Gross 2026-04-20 10:41:14 -07:00
  • ce99f24731 mlxrunner: tokenize prompts in request handler goroutines Jesse Gross 2026-04-03 16:25:33 -07:00
  • 04f5f0cdb4 mlx: improve thread safety of array management Jesse Gross 2026-04-03 16:25:28 -07:00
  • 7bcdb250b9 fix failing client2 unit tests pdevine/addressable-manifest Patrick Devine 2026-04-21 13:56:39 -07:00
  • fb36a01ffe app/ui: fix model picker showing stale model after switching chats (#15280) Matteo Celani 2026-04-21 21:08:06 +02:00
  • 7bbcd2e6be server: add v2 manifest path Patrick Devine 2026-04-20 18:57:20 -07:00
  • 0c65ed33bc cmd: populate model capabilities in launchInteractiveModel (#15712) Michael Verrilli 2026-04-21 13:37:36 -05:00
  • e899ea7ccf address comment Eva Ho 2026-04-21 13:30:43 -04:00
  • 22d6c817f8 mlxrunner: fuse top-P and top-K into a single sort pass Jesse Gross 2026-04-16 13:42:39 -07:00
  • ca01373b28 mlxrunner: use MaxAxis in the min-P sampler Jesse Gross 2026-04-16 13:41:59 -07:00
  • 24e038d56a mlxrunner: add logprobs support Jesse Gross 2026-04-16 13:06:17 -07:00
  • 5d1021603a server: apply format when think=false for gemma4 (#15678) v0.21.1-rc0 Parth Sareen 2026-04-20 17:42:29 -07:00
  • 8e05d734b9 launch: add kimi cli integration with installer flow (#15723) Parth Sareen 2026-04-20 15:33:32 -07:00
  • 05e0f21bec mlx: fuse sigmoid router head in glm4_moe_lite Jesse Gross 2026-04-14 23:45:28 -07:00
  • cfee09b3ab fix test brucemacd/launch-fetch-reccomended Eva Ho 2026-04-20 17:21:52 -04:00
  • cc178cd84d remove test variable Eva Ho 2026-04-20 16:40:32 -04:00
  • 4c92f25354 launch: fetch recommended models from ollama.com Eva Ho 2026-04-20 16:39:25 -04:00
  • bf7d15bdf2 launch: fetch recommended models from server endpoint Bruce MacDonald 2026-04-13 20:27:18 -07:00
  • 0c33775d37 llama/compat: disable mmap when load_op transforms text-side tensors jmorganca/llama-compat jmorganca 2026-04-19 22:18:23 -07:00
  • cc7bdf0bcc llama/compat: handle null buft in maybe_load_tensor jmorganca 2026-04-19 21:57:22 -07:00
  • fd98ffa1e6 llama/compat: add gemma3n + glm4moelite handlers jmorganca 2026-04-19 20:56:17 -07:00
  • 1ce8a6b26d llama/compat: add qwen3-vl + qwen2.5-vl handlers jmorganca 2026-04-19 19:41:05 -07:00
  • cd2dcaff49 llama/compat: add embeddinggemma handler jmorganca 2026-04-19 18:38:41 -07:00
  • a23a5e76f3 llama/compat: fix gemma4a per-block norm tensor mapping jmorganca 2026-04-19 18:08:16 -07:00
  • eb4ecf4fce llama/compat: extend gemma4 clip handler to gemma4a (audio) jmorganca 2026-04-19 17:48:43 -07:00
  • 4b5cf3420a llama/compat: collapse text-loader hook back to one new patch line jmorganca 2026-04-19 17:23:50 -07:00
  • f1bd1a25ac llama/compat: add glm-ocr clip handler (glm4v projector) jmorganca 2026-04-19 17:12:28 -07:00
  • 7e07653271 llama/compat: add glm-ocr text handler + text-loader load-op hook jmorganca 2026-04-19 17:10:29 -07:00
  • 5d45391016 llama/compat: rewrite gemma4 tokenizer model to BPE jmorganca 2026-04-19 16:42:06 -07:00
  • 9945c5a932 server: remove dhiltgen/* compat redirect table jmorganca 2026-04-19 16:36:51 -07:00
  • 034fee349c llama/compat: add gemma4 clip handler (gemma4v projector) jmorganca 2026-04-19 16:27:25 -07:00
  • 9e3b542257 llama/compat: add llama4 text + clip handlers jmorganca 2026-04-19 15:46:52 -07:00
  • 2c7850dbaf llama/compat: add nemotron_h_moe handler (latent FFN + MTP skip) jmorganca 2026-04-19 15:29:25 -07:00
  • 99cb874396 llama/compat: add qwen35, gemma4, deepseek-ocr handlers jmorganca 2026-04-19 15:12:56 -07:00
  • 3a57b89d54 llama/compat: apply LLaMA RoPE permute to mistral3 vision Q/K jmorganca 2026-04-19 14:33:42 -07:00
  • 63bde9ff73 llama/compat: add mistral3 vision (clip) support jmorganca 2026-04-19 14:04:37 -07:00
  • 0860718220 llama/compat: add mistral3 text handler (vision TODO) jmorganca 2026-04-19 13:54:09 -07:00
  • d0f38a915a llama/compat: add gpt-oss and lfm2 handlers jmorganca 2026-04-19 13:43:52 -07:00
  • 9a69a17dc2 llama/compat: document non-public API dependencies jmorganca 2026-04-19 13:01:46 -07:00
  • 2a388da77b llama/compat: split shared infra into a util TU jmorganca 2026-04-19 12:59:21 -07:00
  • db0c745308 llama/compat: add qwen35moe vision (clip) support jmorganca 2026-04-19 12:42:28 -07:00
  • 8fa6648650 llama/compat: add qwen35moe text handler jmorganca 2026-04-19 12:24:16 -07:00
  • 36049361cd llama/compat: simplify shim (gemma3-tested) jmorganca 2026-04-19 12:05:11 -07:00
  • 61b367ec29 llama/compat: shrink patch to pure call-site hooks (34 -> 20 lines) jmorganca 2026-04-19 10:58:03 -07:00
  • 021389f7bb llama/compat: shrink clip.cpp injection from 18 lines to 1 jmorganca 2026-04-19 10:50:34 -07:00
  • 8c2c9d4c89 llama/compat: extend gemma3 handler to cover 1B and 270M blobs jmorganca 2026-04-19 10:42:39 -07:00
  • 436f2e2b15 llama/compat: make patch-apply idempotent jmorganca 2026-04-18 23:34:54 -07:00
  • 7449b539ab llm,server: route Ollama-format gemma3 blobs through llama/compat jmorganca 2026-04-18 23:25:14 -07:00
  • 25223160d8 llama/compat: add in-memory shim so llama-server can load Ollama-format GGUFs jmorganca 2026-04-18 23:14:38 -07:00
  • 56c735d871 runner: Remove CGO engines, use llama-server exclusively for GGML models Daniel Hiltgen 2026-03-25 16:59:18 -07:00
  • ff23dd343f mlx: apply repeat penalties in sampler (#15631) Daniel Hiltgen 2026-04-18 07:49:38 -07:00
  • 123b300af6 docs: update hermes (#15655) Parth Sareen 2026-04-17 14:20:59 -07:00
  • 8c7f95d82f app/server: fix desktop app startup killing active ollama launch sessions Eva Ho 2026-04-17 17:08:02 -04:00
  • 57653b8e42 cmd/launch: show WSL guidance on Windows instead of handing off (#15637) v0.21.0-rc1 v0.21.0 Parth Sareen 2026-04-16 17:18:04 -07:00
  • a50ce61c54 launch: skip unchanged managed-single rewrite (#15633) Parth Sareen 2026-04-16 16:20:42 -07:00
  • 2bb7ea00d2 create: avoid gc race with create (#15628) Daniel Hiltgen 2026-04-16 13:29:16 -07:00
  • 55fa80d07a mlx: additional gemma4 cache fixes (#15607) Daniel Hiltgen 2026-04-16 13:07:19 -07:00
  • b9cb535407 mlx: fix gemma4 cache to use logical view (#15617) v0.21.0-rc0 Daniel Hiltgen 2026-04-16 11:54:30 -07:00
  • 031baef094 mlx: fix imagegen lookup (#15588) Daniel Hiltgen 2026-04-16 10:39:00 -07:00
  • 7d271e6dc9 cmd/launch: add Copilot CLI integration (#15583) Mike Wallio 2026-04-15 20:22:53 -04:00
  • c88dae2d6b Merge pull request #15612 from ollama/drifkin/gemma4-split-templates Devon Rifkin 2026-04-15 17:15:35 -07:00
  • a67e30cf4e Update docs launch-copilot-cli ParthSareen 2026-04-15 15:37:58 -07:00
  • 283b393ed9 docs(readme): add Copilot CLI launch integration Mike Wallio 2026-04-14 10:28:08 -04:00
  • 1b3a200c25 docs(integrations): add Copilot CLI guide Mike Wallio 2026-04-14 10:16:37 -04:00
  • f4438d8215 feat(launch): add Copilot CLI integration Mike Wallio 2026-04-14 10:16:37 -04:00