mirror/exo - exo - Gitea: Git with a cup of tea

mirror/exo

mirror of https://github.com/exo-explore/exo.git synced 2026-04-17 12:30:29 -04:00

Author	SHA1	Message	Date
MikkoParkkola	4b13735ea3	build: remove pyinstaller temp artifacts (#1868 ) removes PyInstaller `build/` leftovers after `just package`	2026-04-11 11:46:03 +00:00
ciaranbor	93a980a61e	`just package` first builds dashboard (#1867 ) ## Motivation <!-- Why is this change needed? What problem does it solve? --> <!-- If it fixes an open issue, please link to the issue here --> ## Changes <!-- Describe what you changed in detail --> ## Why It Works <!-- Explain why your approach solves the problem --> ## Test Plan ### Manual Testing <!-- Hardware: (e.g., MacBook Pro M1 Max 32GB, Mac Mini M2 16GB, connected via Thunderbolt 4) --> <!-- What you did: --> <!-- - --> ### Automated Testing <!-- Describe changes to automated tests, or how existing tests cover this change --> <!-- - -->	2026-04-10 16:25:30 +00:00
vskiwi	fc1ae90111	fix: DeepSeek V3.2 warmup crash and tool calling + add catalog cards (#1769 ) ## Summary DeepSeek V3.2 (`DeepseekV32ForCausalLM`) is already supported by exo's inference engine (architecture whitelisted in `model_cards.py`, DSML encoding added in #1548), but doesn't work out of the box due to two bugs: ### Bug 1: `warmup_inference` passes empty model ID `warmup_inference()` in `generate.py` accepts `model_id: ModelId` as a parameter but creates `TextGenerationTaskParams(model=ModelId(""), ...)` instead of using it. Since `_needs_dsml_encoding()` checks `"deepseek-v3.2" in task_params.model.lower()`, the empty string never matches → falls back to `tokenizer.apply_chat_template()` → ValueError because V3.2 has no Jinja chat template. Fix: `model=ModelId("")` → `model=model_id` (one line). ### Bug 2: `_needs_dsml_encoding` limited to tool calling `_needs_dsml_encoding()` returns `True` only when `task_params.tools` is present or tool messages exist in `chat_template_messages`. For warmup and regular chat requests without tools → `return False` → Jinja fallback → ValueError. Unlike V3.1 (which has a `.jinja` chat template file that transformers picks up automatically), V3.2 has no Jinja template at all — it uses Python-based DSML encoding for all message types. Fix: For V3.2, always return `True` — DSML encoding handles all message types. ### Catalog cards Added inference model cards for: - `mlx-community/DeepSeek-V3.2-8bit` - `mlx-community/DeepSeek-V3.2-4bit` Parameters taken from model `config.json` on HuggingFace, storage sizes from HF API. Capabilities include `thinking_toggle` (related: #1456). ## Notes - The model ID string matching approach (`"deepseek-v3.2" in model.lower()`) is acknowledged tech debt — see #1371 for the planned architecture-based approach. ## Test plan - [x] Start exo with DeepSeek V3.2 model → warmup should complete without crash - [x] Send a regular chat message (no tools) → should get a response - [x] Send a chat message with tools → should work as before - [x] V3.2 cards should appear in the dashboard model catalog --------- Co-authored-by: user <user@m1.note> Co-authored-by: Ryuichi Leo Takashige <leo@exolabs.net> Co-authored-by: Evan <evanev7@gmail.com>	2026-03-25 16:20:35 +00:00
rltakashige	6cdfbb7e8b	Add HF_ENDPOINT in the app settings (#1783 ) ## Motivation Some users (primarily in China) are unable to access HuggingFace.co. HF supports the HF_ENDPOINT env variable, and we also support it, but there is no way to easily do that from the app currently. ## Test Plan ### Manual Testing hf-mirror works empty field works google.com endpoint fails	2026-03-24 17:05:49 +00:00
Evan Quiney	6b907398a4	cancel downloads for deleted instances (#1393 ) after deleting an instance, if a given (node_id, model_id) pair doesn't exist in the left over instances, cancel the download of model_id on node_id.	2026-02-05 18:16:43 +00:00
Evan Quiney	991d278119	replace nix fmt with treefmt in just lint (#1301 ) man evaluating the nix flake is so slow. treefmt speeeedy	2026-01-27 17:03:01 +00:00
rltakashige	a735dad667	Parse GPT OSS in runner (#1160 ) ## Motivation Simplification of API + moving model specific code to the runner <!-- Why is this change needed? What problem does it solve? --> <!-- If it fixes an open issue, please link to the issue here --> ## Test Plan ### Manual Testing Tested that GPT OSS outputs are parsed correctly on the dashboard. ### Automated Testing <!-- Describe changes to automated tests, or how existing tests cover this change --> <!-- - -->	2026-01-15 19:53:55 +00:00
Evan Quiney	09593c5e85	backport the dashboard to staging	2025-12-17 12:22:22 +00:00
Jake Hillion	ac3a0a6b47	ci: enable `ruff check` in CI through nix	2025-12-09 12:26:56 +00:00
Jake Hillion	39d76aa0a5	nix: move formatting checks to nix and enable in ci	2025-12-05 17:00:33 +00:00
Jake Hillion	5ef1df1e10	rust: move Cargo.toml to the root	2025-12-05 12:01:44 +00:00
Evan Quiney	10c905c8dd	worker no longer gets stuck after shutdown	2025-12-02 11:35:02 +00:00
rltakashige	ff00b165c5	MLX LM type stubs	2025-11-06 21:59:29 +00:00
rltakashige	6bbb6344b6	mlx.distributed.Group type stubs	2025-11-06 05:26:04 +00:00
rltakashige	16f724e24c	Update staging 14 Co-authored-by: Evan <evanev7@gmail.com> Co-authored-by: Alex Cheema <alexcheema123@gmail.com> Co-authored-by: David Munha Canas Correia <dmunha@MacBook-David.local> Co-authored-by: github-actions bot <github-actions@users.noreply.github.com>	2025-11-05 01:44:24 +00:00
Evan Quiney	a4e8335241	add just clean	2025-10-07 16:29:51 +01:00
Evan Quiney	b1721e941b	nix cleanup	2025-10-01 09:47:00 +01:00
Andrei Cravtov	ef5c5b9654	changes include: ipc, general utilities, flakes stuff w/ just, autopull script	2025-08-25 17:33:40 +01:00
Evan Quiney	11f8b4ef33	tidy: fix justfile, run.sh, run formatter	2025-08-21 18:44:53 +01:00
Evan Quiney	be6f5ae7f1	feat: build system and homebrew compatibility	2025-08-21 16:07:37 +01:00
Gelu Vrabie	345fafd80d	Forwarder versioning Co-authored-by: Gelu Vrabie <gelu@exolabs.net>	2025-08-18 15:08:50 +01:00
Andrei Cravtov	7e19804aa5	Integrate flake parts	2025-08-13 09:55:22 +01:00
Gelu Vrabie	0e32599e71	fix libp2p + other prs that were wrongly overwritten before (111,112,117,118,1119 + misc commits from Alex) Co-authored-by: Gelu Vrabie <gelu@exolabs.net> Co-authored-by: Alex Cheema <41707476+AlexCheema@users.noreply.github.com> Co-authored-by: Seth Howes <71157822+sethhowes@users.noreply.github.com> Co-authored-by: Matt Beton <matthew.beton@gmail.com> Co-authored-by: Alex Cheema <alexcheema123@gmail.com>	2025-07-31 20:36:47 +01:00
Gelu Vrabie	ff3d11c748	just run Co-authored-by: Gelu Vrabie <gelu@exolabs.net>	2025-07-29 16:58:27 +01:00
Gelu Vrabie	4c0e4ef853	Go build Co-authored-by: Gelu Vrabie <gelu@exolabs.net>	2025-07-24 19:45:45 +01:00
Andrei Cravtov	3ab5609289	wrote race-condition-free persistent NodeID-getting function	2025-07-23 20:18:56 +01:00
Andrei Cravtov	8d2536d926	Implemented basic discovery library in Rust + python bindings Co-authored-by: Gelu Vrabie <gelu@exolabs.net> Co-authored-by: Seth Howes <sethshowes@gmail.com> Co-authored-by: Matt Beton <matthew.beton@gmail.com>	2025-07-23 13:11:29 +01:00
Arbion Halili	9b3c105bea	fix: Save Andrei's sanity	2025-07-15 13:11:20 +01:00
Arbion Halili	4e4dbf52ec	fix: Use Nix-compatible LSP set-up	2025-07-14 21:08:43 +01:00
Arbion Halili	b758df83cf	Chore: Tweak CI	2025-06-30 22:41:33 +01:00
Alex Cheema	784f0ec423	chore: Skip protobuf generation if no .proto files exist	2025-06-29 21:52:46 +01:00
Arbion Halili	74adbc4280	Remove PoeThePoet	2025-06-28 14:33:01 +01:00
Arbion Halili	f7f779da19	Fix Type Checker; Improve Protobuf Generation	2025-06-28 12:28:26 +01:00
Arbion Halili	61b8b1cb18	Add Protobuf Support	2025-06-28 01:26:49 +01:00

34 Commits