Commit Graph

  • 099a0f18ef build: fix Dockerfile mlx directory (#14131) main v0.15.6 Jeffrey Morgan 2026-02-06 17:08:53 -08:00
  • 9319f13ff5 compile expert select mxyng/mlx-compile Michael Yang 2026-02-06 15:53:36 -08:00
  • 26b08f889e tmp Michael Yang 2026-02-06 14:40:19 -08:00
  • fff696ee31 docs: increased RAM requirement for parallelism Richard Lyons 2026-02-07 00:16:35 +01:00
  • 2e3ce6eab3 anthropic: do not count image tokens for now (#14127) Jeffrey Morgan 2026-02-06 15:33:18 -08:00
  • bd5d3b0ebd quant mxyng/mlx-quant Michael Yang 2026-02-06 09:50:05 -08:00
  • 20299cb1da nil keys mxyng/mlx-glm4.7 Michael Yang 2026-02-04 17:26:22 -08:00
  • da4d04b0e8 glm4.7 Michael Yang 2026-02-03 15:13:18 -08:00
  • ee3093f828 save Michael Yang 2026-02-04 15:50:52 -08:00
  • 260c5ca65a cleanup afterloadfunc Michael Yang 2026-02-04 15:47:07 -08:00
  • 1ec216fe0a append vector mxyng/mlx Michael Yang 2026-02-06 14:47:34 -08:00
  • e19fbe7369 s/tensor/array/g Michael Yang 2026-02-05 20:47:04 -08:00
  • 5287acdb21 mlxrunner Michael Yang 2026-02-05 11:34:26 -08:00
  • 5da14833e5 draft: model manifest file interface Michael Yang 2026-01-19 13:34:24 -08:00
  • d5ac80125f colour changes + feed the linter pdevine/launch Patrick Devine 2026-02-06 12:26:47 -08:00
  • fb325cf88a remove logo Patrick Devine 2026-02-06 11:08:18 -08:00
  • c54b99fa6a fix go.sum Patrick Devine 2026-02-04 16:26:06 -08:00
  • 248a183e0b feed the linter Patrick Devine 2026-02-04 16:24:38 -08:00
  • cab70db823 model selector improvments Patrick Devine 2026-02-04 16:16:56 -08:00
  • 6e32d73afe fix unit test Patrick Devine 2026-02-03 11:46:21 -08:00
  • ac1fefc52d remember last selection Patrick Devine 2026-02-03 11:27:21 -08:00
  • 1de2e8f1a6 gofumpt the linter Patrick Devine 2026-02-02 18:52:06 -08:00
  • 4ec13bc642 launch: new menu system for ollama launch Patrick Devine 2026-02-02 17:31:32 -08:00
  • d5a2849d1f cmd: set codex env vars on launch and handle zstd request bodies brucemacd/pass-launch-args Bruce MacDonald 2026-02-06 10:42:18 -08:00
  • 9e2003f88a cmd/config: offer to pull missing models instead of erroring (#14113) Parth Sareen 2026-02-06 13:19:47 -05:00
  • 42e1d49fbe cmd: fix context limits for droid and add qwen3-coder-next ctx (#14112) Parth Sareen 2026-02-06 01:29:53 -05:00
  • 814630ca60 Revert "move tokenizers to separate package (#13825)" (#14111) Michael Yang 2026-02-05 20:49:08 -08:00
  • 87cf187774 cmd: set claude code env vars on launch (#14109) Parth Sareen 2026-02-05 22:04:53 -05:00
  • 6ddd8862cd chore: move x/mlxrunner into x/imagegen (#14100) Michael Yang 2026-02-05 18:25:56 -08:00
  • f1373193dc move tokenizers to separate package (#13825) Michael Yang 2026-02-05 17:44:11 -08:00
  • 8a4b77f9da cmd: set context limits for cloud models in opencode (#14107) v0.15.5 Parth Sareen 2026-02-05 19:36:46 -05:00
  • f92a82db15 app: match model picker to server models brucemacd/simplify-model-picker Bruce MacDonald 2026-02-05 15:43:22 -08:00
  • 2e9d9acf18 add ability to turn on debug request logging drifkin/debug-request-logger Devon Rifkin 2026-02-05 14:57:04 -08:00
  • 5f53fe7884 cmd: ollama launch improvements (#14099) v0.15.5-rc5 Parth Sareen 2026-02-05 18:08:17 -05:00
  • 7ab4ca0e7f scripts: add macOS support to install.sh (#14060) Bruce MacDonald 2026-02-05 14:59:01 -08:00
  • e36f389e82 scheduler: default parallel=1 for qwen3next/lfm (#14103) Jeffrey Morgan 2026-02-05 12:48:25 -08:00
  • c330ea33ed qwen3next: handle mixed recurrent batches jmorganca/qwen3-concurrent jmorganca 2026-02-05 11:47:27 -08:00
  • 52f757d8a2 cmd: fix gofmt formatting in pi integration parth-launch-pi ParthSareen 2026-02-04 19:27:34 -08:00
  • 86aa7cd0a6 cmd: add pi integration to ollama launch ParthSareen 2026-02-04 18:51:40 -08:00
  • c61023f554 ollamarunner: Fix off by one error with numPredict v0.15.5-rc4 Jesse Gross 2026-02-04 15:36:11 -08:00
  • d25535c3f3 qwen3next: avoid inplace sigmoid for shared gate (#14077) Jeffrey Morgan 2026-02-04 15:50:02 -08:00
  • c323161f24 cmd: helpful error message for remote models (#14057) Bruce MacDonald 2026-02-04 14:55:11 -08:00
  • 255579aaa7 qwen3next: fix issue in delta net (#14075) v0.15.5-rc3 Jeffrey Morgan 2026-02-04 13:40:38 -08:00
  • f7102ba826 runner: discard compute results if sequence replaced mid-batch (#14072) Jeffrey Morgan 2026-02-04 13:19:48 -08:00
  • cefabd79a8 Revert "cmd: claude launch improvements (#14064)" (#14071) Jeffrey Morgan 2026-02-04 09:10:37 -08:00
  • df70249520 server: optimize chatPrompt to reduce tokenization calls (#14040) Jeffrey Morgan 2026-02-04 01:21:31 -08:00
  • 77eb2ca619 model: add qwen3-next architecture (#14051) v0.15.5-rc2 Jeffrey Morgan 2026-02-03 23:27:21 -08:00
  • ee25219edd cmd: claude launch improvements (#14064) Parth Sareen 2026-02-03 22:33:58 -05:00
  • b1fccabb34 Revert "Update vendored llama.cpp to b7847" (#14061) Jeffrey Morgan 2026-02-03 18:39:36 -08:00
  • a6355329bf cmd: open browser on ollama signin when available (#14055) Bruce MacDonald 2026-02-03 16:42:09 -08:00
  • 55746e31fa ggml: add MLA flash attention config for GLM-4.7-flash fix-glm-4.7-flash-mla-config jmorganca 2026-02-03 12:57:48 -08:00
  • 0398b24b42 cmd: launch defaults (#14035) v0.15.5-rc1 Parth Sareen 2026-02-03 02:19:11 -05:00
  • 75b1dddf91 cmd: launch extra params (#14039) Parth Sareen 2026-02-03 02:03:33 -05:00
  • e1e80ffc3e cmd/config: move config location (#14034) Parth Sareen 2026-02-02 22:48:51 -05:00
  • 71896485fd anthropic: add InputTokens to streaming response (#13934) Aleksandr Vukmirovich 2026-02-03 03:29:37 +01:00
  • ef00199fb4 Update vendor ggml code to a5bb8ba4 (#13832) Jeffrey Morgan 2026-02-02 17:31:59 -08:00
  • 846f3fbcc8 app: expose server's default context length to UI ollama-new-context jmorganca 2026-02-02 15:18:13 -08:00
  • 8f4a008139 Add GLM-OCR vision model support (#14024) v0.15.5-rc0 Jeffrey Morgan 2026-02-02 15:39:18 -08:00
  • d8cc798c2b glm 4.7 flash support on experimental engine (#13838) Patrick Devine 2026-02-02 15:22:11 -08:00
  • b202a9b4ce qwen3-coder parser: allow missing opening tool call tag drifkin/qwen3-coder-opening-tag Devon Rifkin 2026-02-02 12:53:45 -08:00
  • 6582f6da5c llm: Make "do load request" error message more informative Richard Lyons 2026-01-30 16:34:21 +01:00
  • 0334ffa625 server: use tiered VRAM-based default context length Jesse Gross 2026-01-27 16:12:17 -08:00
  • d11fbd2c60 server: fix ollama ps showing configured instead of actual context length Jesse Gross 2026-01-27 16:27:55 -08:00
  • 6a7c3f188e openclaw: run onboarding for fresh installs (#14006) v0.15.4 Jeffrey Morgan 2026-02-01 13:46:45 -08:00
  • 427e2c962a docs: add redirect from clawdbot to openclaw (#14004) Jeffrey Morgan 2026-01-31 20:50:42 -08:00
  • 27db7f806f cmd/config: rename integration to openclaw (#13979) v0.15.3 Thanh Nguyen 2026-02-01 06:31:13 +07:00
  • 3590fbfa76 runner: fix typo 'baackend' -> 'backend' in error messages (#13645) Dhiraj Lochib 2026-02-01 02:56:20 +05:30
  • cd0094f772 added stakpak to web & desktop (#13961) noureldin-azzab 2026-01-31 23:04:34 +02:00
  • 06bc8e6712 docs: add Screenpipe to Community Integrations (#13906) Louis Beaumont 2026-01-31 12:49:52 -08:00
  • fc5f9bb448 docs: remove unsupported quantizations (#13982) frob 2026-01-31 21:46:20 +01:00
  • a0740f7ef7 docs: add GB10 to supported devices (#13987) frob 2026-01-31 21:45:27 +01:00
  • a0923cbdd0 cmd: ollama launch add placeholder text for selector (#13966) Parth Sareen 2026-01-29 12:48:49 -05:00
  • f92e362b2e cmd: capitalize Ollama in serve command help text (#13965) Seokrin Taron Sung 2026-01-30 02:47:53 +09:00
  • aa23d8ecd2 docs: update installation command for OpenCode CLI (#13971) Tincho 2026-01-29 14:47:02 -03:00
  • c0496e6125 fix lint brucemacd/usage-api Bruce MacDonald 2026-01-28 13:16:52 -08:00
  • 2d57bcbc64 fix tests Bruce MacDonald 2026-01-28 13:07:48 -08:00
  • e6f5a982d3 cmd: add usage cmd to chat to see token consumption brucemacd/usage-cli Bruce MacDonald 2026-01-27 17:14:25 -08:00
  • 060f9341c0 server: usage api Bruce MacDonald 2026-01-27 17:01:18 -08:00
  • 7b62c41060 cmd/config: use envconfig.Host() for base API in launch config packages (#13937) Gabe Goodhart 2026-01-27 14:30:00 -07:00
  • 26acab64b7 docs: add clawdbot (#13925) Parth Sareen 2026-01-26 21:32:54 -05:00
  • e0f03790b1 parsers/ministral: fix nested tool call parsing by counting brace nesting (#13905) Gyungrai Wang 2026-01-27 08:03:43 +09:00
  • 3ab842b0f5 cmd: clawdbot config fixes (#13922) v0.15.2 Parth Sareen 2026-01-26 17:34:29 -05:00
  • b8e8ef8929 cmd: ollama launch clawdbot (#13921) Parth Sareen 2026-01-26 16:40:59 -05:00
  • 465d124183 cmd: fix opencode config (#13894) v0.15.1 Parth Sareen 2026-01-24 21:42:56 -05:00
  • d310e56fa3 cmd: add fallback for claude (#13892) Parth Sareen 2026-01-24 21:26:01 -05:00
  • a1ca428c90 glm4moelite: fix attention scale calculation (#13893) Jeffrey Morgan 2026-01-24 17:48:09 -08:00
  • 16750865d1 glm4moelite: quantize more tensors to q8_0 and avoid double BOS token (#13891) v0.15.1-rc1 Jeffrey Morgan 2026-01-24 16:33:54 -08:00
  • f3b476c592 build: add -O3 optimization to CGO flags (#13877) v0.15.1-rc0 Jeffrey Morgan 2026-01-24 10:55:38 -08:00
  • 5267d31d56 docs: ollama launch (#13852) Parth Sareen 2026-01-24 02:18:50 -05:00
  • b44f56319f README: Update the "Ollama for ruby" to the most popular and maintained ruby gem. (#13855) Stillhart 2026-01-24 07:24:52 +01:00
  • 0209c268bb llama: fix CUDA MMA errors in release build (#13874) v0.15.0-rc6 v0.15.0 Jeffrey Morgan 2026-01-23 20:10:04 -08:00
  • 8e22b09e2c ggml-cuda: fix fattn build for GLM 4.7 flash support fix-cuda12-fattn-shmem Jeffrey Morgan 2026-01-24 03:12:49 +00:00
  • 912d984346 llama: fix fattn-tile shared memory overflow on sm_50/52 (#13872) v0.15.0-rc5 Jeffrey Morgan 2026-01-23 19:22:32 -08:00
  • aae6ecbaff cmd: rename ollama config to ollama launch (#13871) v0.15.0-rc4 Parth Sareen 2026-01-23 21:40:40 -05:00
  • 64737330a4 Re-apply "model: add MLA absorption for glm4moelite" with fix (#13870) Jeffrey Morgan 2026-01-23 18:40:28 -08:00
  • 2eda97f1c3 Revert "model: add MLA absorption for glm4moelite (#13810)" (#13869) v0.15.0-rc3 Jeffrey Morgan 2026-01-23 17:14:15 -08:00
  • 66831dcf70 x/imagegen: fix image editing support (#13866) v0.15.0-rc2 Jeffrey Morgan 2026-01-23 15:37:17 -08:00
  • 1044b0419a model: add MLA absorption for glm4moelite (#13810) Jeffrey Morgan 2026-01-23 14:47:42 -08:00
  • 771d9280ec cmd: ollama config fix droid model name configuration (#13856) Parth Sareen 2026-01-23 14:44:22 -05:00
  • 862bc0a3bf x/imagegen: respect stream=false in /api/generate (#13853) Jeffrey Morgan 2026-01-22 22:16:39 -08:00