mirror/exo - exo - Gitea: Git with a cup of tea

mirror/exo

mirror of https://github.com/exo-explore/exo.git synced 2026-04-17 12:30:29 -04:00

Author	SHA1	Message	Date
Evan Quiney	9b381f7bfe	bump and simplify flake (#1866 ) seems like stablepkgs swiftfmt works now! also bump macmon to 0.7	2026-04-13 15:45:17 +00:00
ciaranbor	15f1b61f4c	Rework model storage directory management (for external storage) (#1765 ) ## Motivation Replace confusing EXO_MODELS_DIR/EXO_MODELS_PATH with clearer multi-directory support, enabling automatic download spillover across volumes. ## Changes - EXO_MODELS_DIRS: colon-separated writable dirs (default always prepended, first with enough space wins) - EXO_MODELS_READ_ONLY_DIRS: colon-separated read-only dirs (protected from deletion) - select_download_dir(): picks writable dir by free space - resolve_existing_model(): unified lookup across all dirs - is_read_only_model_dir(): path-based read-only detection instead of hardcoded flag - Updated coordinator, worker, model cards, tests ## Why It Works Default dir always included so zero-config behavior is unchanged. Disk space checked at download time for automatic spillover. Read-only status derived from path, not hardcoded. ## Test Plan ### Manual Testing - No env vars set → identical behavior - EXO_MODELS_DIRS=/Volumes/SSD/models → downloads to external storage - EXO_MODELS_READ_ONLY_DIRS=/mnt/nfs → models found, deletion blocked ### Automated Testing - 4 new tests in test_xdg_paths.py (prepend, default-only, overlap, empty read-only) - Existing tests updated to patch new constants	2026-03-26 17:46:46 +00:00
Evan Quiney	7ee88c1f05	override macmon in flake (#1747 ) updates macmon to an upstream fork that fixes m5 max issues. might see if the upstream version gets merged before we release. --------- Co-authored-by: Alex Cheema <alexcheema123@gmail.com>	2026-03-24 17:30:19 +00:00
Andrei Onel	7a36d3968d	docs: Update documentation for v1.0.68 release (#1667 ) ## Motivation Updated documentation for v1.0.68 release ## Changes docs/api.md: - Added documentation for new API endpoints: Claude Messages API (`/v1/messages`), OpenAI Responses API (`/v1/responses`), and Ollama API compatibility endpoints - Documented custom model management endpoints (`POST /models/add`, `DELETE /models/custom/{model_id}`) - Added `enable_thinking` parameter documentation for thinking-capable models (DeepSeek V3.1, Qwen3, GLM-4.7) - Documented usage statistics in responses (prompt_tokens, completion_tokens, total_tokens) - Added streaming event format documentation for all API types - Updated image generation section with FLUX.1-Kontext-dev support and new dimensions (1024x1365, 1365x1024) - Added request cancellation documentation - Updated complete endpoint summary with all new endpoints - Added security notes about trust_remote_code being opt-in README.md: - Updated Features section to highlight multiple API compatibility options - Added Environment Variables section documenting all configuration options (EXO_MODELS_PATH, EXO_OFFLINE, EXO_ENABLE_IMAGE_MODELS, EXO_LIBP2P_NAMESPACE, EXO_FAST_SYNCH, EXO_TRACING_ENABLED) - Expanded "Using the API" section with examples for Claude Messages API, OpenAI Responses API, and Ollama API - Added custom model loading documentation with security notes - Updated file locations to include log files and custom model cards paths CONTRIBUTING.md: - Added documentation for TOML model cards format and the API adapter pattern docs/architecture.md: - Documented the adapter architecture introduced in PR #1167 Closes #1653 --------- Co-authored-by: askmanu[bot] <192355599+askmanu[bot]@users.noreply.github.com> Co-authored-by: Evan Quiney <evanev7@gmail.com>	2026-03-06 11:32:46 +00:00
rltakashige	51021f6fc6	Add cancellation button and the ability to cancel during prefill (#1540 ) ## Motivation There's no way to easily use the cancellation features we added! Also, prefill can take ages so let's allow cancelling out of that. ## Changes Wiring up our existing functionality to easily cancel during generation (and adding stuff to do so during prefill) ## Test Plan ### Manual Testing Tested it works during both prefill and decode. ### Automated testing Needs testing to see if this causes a GPU timeout error on large prefill on large models in pipeline parallel. However, from manually testing GLM 5 pipeline ring on 2 nodes, and from reading the code, it does not seem like this will be the case.	2026-02-19 11:40:59 +00:00
rltakashige	f2be929211	Leo/address rdma gpu locks 2 (#1515 ) Same as #1489 . Had to revert and redo thanks to Claude. --------- Co-authored-by: Jake Hillion <jake@hillion.co.uk> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 14:00:52 -08:00
rltakashige	83af8c63fa	Revert "Use custom fork that resolves GPU locks" (#1502 ) Reverts exo-explore/exo#1489 Goddammit Claude...	2026-02-17 18:18:54 +00:00
rltakashige	facf2d4d03	Use custom fork that resolves GPU locks (#1489 ) ## Motivation There is an issue on Macs that means that an explicit synchronization is necessary for memory to be updated from L1 cache. This means that GPU locks can occur when a spin wait does not see the updated timestamp. ## Changes Updated in my own personal fork. ## Why It Works https://github.com/ARM-software/acle/releases ## Test Plan ### Manual Testing Tested manually that no GPU locks occur (even with multiple simultaneous instances running) and that the performance differential is negligible (267 vs 269 tps on Llama 3.2 1B at an approx 10k context.) ------------------------------------------------------ I have seen a GPU lock, specifically when sending a particularly large chat completion while the model was loading. However, I have since been unable to reproduce and this may be something I did wrong. Please do create an issue and tag me if any GPU locks do occur. --------- Co-authored-by: Jake Hillion <jake@hillion.co.uk> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 17:48:43 +00:00
rltakashige	9e58a57599	Add RDMA caveats to README.md (#1316 ) ## Motivation Running RDMA from source is not well documented as is. Several surprising things that took time to debug internally too. App should be updated to detect MacOS versions in future.	2026-01-28 18:44:00 +00:00
Alex Cheema	f1a2d054ec	Update tagline to "Run frontier AI locally" (#1313 ) - Update README tagline from "Run your own AI cluster at home with everyday devices" to "Run frontier AI locally"	2026-01-28 12:38:14 +00:00
Alex Cheema	8f6726d6be	Fix config.json download errors for image models (#1245 ) ## Motivation When `get_shard_download_status()` runs, it iterates over all models in `MODEL_CARDS` and calls `build_full_shard()` → `build_base_shard()` → `ModelCard.from_hf()`. This unconditionally tried to download `config.json` from HuggingFace, but image models (FLUX, Qwen-Image) don't have a root-level config.json file, causing errors: ``` Error downloading shard: File not found: https://huggingface.co/black-forest-labs/FLUX.1-dev/resolve/main/config.json Error downloading shard: File not found: https://huggingface.co/black-forest-labs/FLUX.1-schnell/resolve/main/config.json Error downloading shard: File not found: https://huggingface.co/Qwen/Qwen-Image/resolve/main/config.json Error downloading shard: File not found: https://huggingface.co/Qwen/Qwen-Image-Edit-2509/resolve/main/config.json ``` ## Changes ### ModelCard.load() fix - `build_base_shard()` now uses `ModelCard.load()` instead of `ModelCard.from_hf()` - `ModelCard.load()` iterates through `MODEL_CARDS.values()` to find a match by `model_id` ### exo-bench fixes - Use `name` field instead of `id` for model resolution - Pass `full_model_id` to `/instance/previews` endpoint - Make model name matching case-insensitive - Update README example model name ## Why It Works `MODEL_CARDS` uses short names as keys (e.g., `"flux1-schnell"`) but the `model_id` values are HuggingFace paths (e.g., `"black-forest-labs/FLUX.1-schnell"`). When `ModelCard.load()` was called with the HF path, it didn't match any key and fell back to `from_hf()` which tried to download config.json. The fix iterates through `MODEL_CARDS.values()` to find a match by `model_id`, ensuring predefined models (including image models) use their registry entries directly without network calls. A key lookup is unnecessary since `load()` is always called with HF paths which don't match the short-name keys. ## Test Plan ### Manual Testing - Run exo and verify no more "Error downloading shard: File not found: .../config.json" errors for image models - Run exo-bench and verify model resolution works correctly ### Automated Testing - `uv run basedpyright` - passes with 0 errors - `uv run pytest` - all tests pass 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-21 21:30:48 +00:00
Andrei Onel	ce3ad391b1	Update README.md with some changes from release 1.0.61 (#1157 ) Updated README.md with documentation for four new features: - added a "Benchmarking" section documenting the exo-bench tool for measuring model performance across different placement configurations - documented the custom namespace feature for cluster isolation in the macOS app section - added a "Configuration Options" subsection explaining the --no-worker CLI flag for coordinator-only nodes - added a "File Locations (Linux)" subsection documenting XDG Base Directory Specification compliance on Linux systems Issue #930	2026-01-19 16:43:18 +00:00
Alex Cheema	7ff937d8a1	Add dashboard screenshots to README (#1185 ) ## Motivation The README showcases exo's features and benchmarks but doesn't show what the dashboard actually looks like. Adding a screenshot helps users understand what they'll get when they run exo. ## Changes - Added dashboard screenshot to `docs/imgs/dashboard-cluster-view.png`: Shows the cluster topology view with 4 × 512GB M3 Ultra Mac Studio running DeepSeek v3.1 (8-bit) and Kimi-K2-Thinking (4-bit) - Added a new "Dashboard" section to README.md below Features, displaying the screenshot with caption ## Why It Works Visual documentation helps users understand what exo offers before they install it. The screenshot demonstrates the cluster management capabilities. ## Test Plan ### Manual Testing - Verified image renders correctly in GitHub markdown preview ### Automated Testing - N/A - documentation only change Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-19 10:43:27 +00:00
PG	b74a610537	Add a basic documentation to the api interface (#1122 ) ## Motivation Adds basic api documentation ## Changes - Add docs/api.md - Modify README.md	2026-01-11 18:44:40 +00:00
Sami Khan	d85b5d3781	feat: uninstall button (#1077 ) ## Motivation https://github.com/exo-explore/exo/issues/1075 ## Changes - Added in-app "Uninstall" option under Advanced menu that cleanly removes all system components - Added NetworkSetupHelper.uninstall() to remove LaunchDaemon, scripts, logs, and restore network settings - Added LaunchAtLoginHelper.disable() to unregister from login items - Created standalone uninstall-exo.sh script for users who already deleted the app - Added uninstall documentation to README <img width="386" height="577" alt="image" src="https://github.com/user-attachments/assets/6bbcd18a-992a-409d-8791-ed5e13bbcfe0" /> <img width="372" height="432" alt="image" src="https://github.com/user-attachments/assets/ee76b45d-c111-4807-ab28-3f2f20e01140" /> ## Why It Works The in-app uninstaller runs a privileged shell script (via AppleScript) to launchctl bootout the daemon, remove files, and restore the "Automatic" network location. The standalone script provides the same cleanup for users who already deleted the app. ## Test Plan ### Manual Testing Hardware: MacBook Pro - Built and ran app, verified LaunchDaemon and network location were created - Used in-app Uninstall, verified all components removed and network restored to Automatic - Rebuilt app, quit normally, ran sudo ./uninstall-exo.sh, verified same cleanup ### Automated Testing N/A --------- Co-authored-by: Evan <evanev7@gmail.com>	2026-01-09 14:49:08 +00:00
Alex Cheema	4963c33162	Fix Discord link in README.md. Fixes #1096 (#1097 ) ## Motivation Discord link expired. ## Changes Replace discord invite link with permanent link. ## Why It Works It's permanent now. ## Test Plan Clicked the link. It works.	2026-01-06 14:05:09 +00:00
Alex Cheema	ca7adcc2a8	Update README.md with instructions to enable RDMA. (#1031 ) ## Motivation We didn't have instructions for enabling RDMA on macOS. ## Changes I added instructions for enabling RDMA on macOS. ## Why It Works Tried it on my M4 Max MacBook Pro and works. ## Test Plan ### Manual Testing Tried it on my M4 Max MacBook Pro and works. ### Automated Testing In the future, we could automate this from fresh macOS builds using KVM over IP. See #1030	2025-12-28 20:56:26 +00:00
Matiwos Kebede	eabdcab978	Fix linux docs (#1022 ) This PR updates the "Run from Source (Mac & Linux)" section in README.md to clarify Linux instructions. Changes include: - Split the section into macOS and Linux subsections. - Added native Linux package manager commands (apt, dnf, pacman) for dependencies: uv, node, npm. - Clarified that macmon is macOS-only. - Noted that Homebrew on Linux is optional, with native package managers preferred. These changes improve clarity for Linux users and fix confusion from the previous macOS-centric instructions.	2025-12-27 19:56:44 +00:00
rltakashige	51a5191ff3	format readme (#978 ) ## Motivation README looks weird after last update. <!-- Why is this change needed? What problem does it solve? --> <!-- If it fixes an open issue, please link to the issue here --> ## Changes <!-- Describe what you changed in detail --> ## Why It Works <!-- Explain why your approach solves the problem --> ## Test Plan ### Manual Testing <!-- Hardware: (e.g., MacBook Pro M1 Max 32GB, Mac Mini M2 16GB, connected via Thunderbolt 4) --> <!-- What you did: --> <!-- - --> I actually checked the file on GitHub this time. ### Automated Testing <!-- Describe changes to automated tests, or how existing tests cover this change --> <!-- - -->	2025-12-22 18:06:27 +00:00
Evan Quiney	1efbd26388	add architecture.md, move images to docs/imgs (#968 ) ## Motivation Documentation will make contribution easier and communicate our development philosophy and decision process. Closes #967 ## Changes Added `architecture.md` to docs/ and moved the images out of docs and into their own docs/imgs/ folder	2025-12-22 17:57:43 +00:00
rltakashige	fc41bfa1f1	Add all prerequisites to README (#975 ) ## Motivation Addresses #974 ``` INFO: pip is looking at multiple versions of exo to determine which version is compatible with other requirements. This could take a while. ERROR: Could not find a version that satisfies the requirement exo-pyo3-bindings (from exo) (from versions: none) ERROR: No matching distribution found for exo-pyo3-bindings ``` ## Changes Describes Rust dependency for building from source ## Why It Works <!-- Explain why your approach solves the problem --> ## Test Plan ### Manual Testing <!-- Hardware: (e.g., MacBook Pro M1 Max 32GB, Mac Mini M2 16GB, connected via Thunderbolt 4) --> <!-- What you did: --> <!-- - --> Tested locally and runs after this setup without exo-pyo3-bindings error ### Automated Testing <!-- Describe changes to automated tests, or how existing tests cover this change --> <!-- - -->	2025-12-22 17:38:51 +00:00
Nightguarder	1e75aeb2c2	Add Prerequisites to Readme (#936 ) ## Motivation Users need to know what prerequisites they need in order to run exo. Simple addition to docs prevents future raised issues. ## Changes Updated ``README.md``: - to include installation instructions for [uv](https://github.com/astral-sh/uv) and [macmon](https://github.com/vladkens/macmon). Updated ``CONTRIBUTING.md``: - to verify these prerequisites are met before starting development. - Standardized on brew installation instructions for macOS users to keep the guide simple. ## Why It Works By listing these prerequisites upfront, users will set up their environment correctly before attempting to run exo. ## Test Plan ### Manual Testing MacBook Pro M4 - Verified that ``uv`` and ``macmon`` were missing initially, causing failures - after installing them via brew (as documented), uv run exo starts successfully. ### Automated Testing <!-- Describe changes to automated tests, or how existing tests cover this change --> <!-- - --> --------- Co-authored-by: Evan Quiney <evanev7@gmail.com>	2025-12-22 02:28:08 +00:00
Alex Cheema	abaeb0323d	Update README.md. (#956 ) ## Motivation <!-- Why is this change needed? What problem does it solve? --> Made a mistake on the merge of the last PR. <!-- If it fixes an open issue, please link to the issue here --> ## Changes <!-- Describe what you changed in detail --> ## Why It Works <!-- Explain why your approach solves the problem --> ## Test Plan ### Manual Testing <!-- Hardware: (e.g., MacBook Pro M1 Max 32GB, Mac Mini M2 16GB, connected via Thunderbolt 4) --> <!-- What you did: --> <!-- - --> ### Automated Testing <!-- Describe changes to automated tests, or how existing tests cover this change --> <!-- - -->	2025-12-21 23:09:44 +00:00
Alex Cheema	7d15fbdaab	readme tweaks5 (#954 ) ## Motivation <!-- Why is this change needed? What problem does it solve? --> <!-- If it fixes an open issue, please link to the issue here --> ## Changes <!-- Describe what you changed in detail --> ## Why It Works <!-- Explain why your approach solves the problem --> ## Test Plan ### Manual Testing <!-- Hardware: (e.g., MacBook Pro M1 Max 32GB, Mac Mini M2 16GB, connected via Thunderbolt 4) --> <!-- What you did: --> <!-- - --> ### Automated Testing <!-- Describe changes to automated tests, or how existing tests cover this change --> <!-- - -->	2025-12-21 22:48:35 +00:00
Alex Cheema	4a6e0fe171	Update README.md. (#949 ) ## Motivation <!-- Why is this change needed? What problem does it solve? --> <!-- If it fixes an open issue, please link to the issue here --> ## Changes <!-- Describe what you changed in detail --> ## Why It Works <!-- Explain why your approach solves the problem --> ## Test Plan ### Manual Testing <!-- Hardware: (e.g., MacBook Pro M1 Max 32GB, Mac Mini M2 16GB, connected via Thunderbolt 4) --> <!-- What you did: --> <!-- - --> ### Automated Testing <!-- Describe changes to automated tests, or how existing tests cover this change --> <!-- - -->	2025-12-21 18:31:23 +00:00
Alex Cheema	f8483cfc18	Update README.md. (#932 ) ## Motivation <!-- Why is this change needed? What problem does it solve? --> <!-- If it fixes an open issue, please link to the issue here --> ## Changes <!-- Describe what you changed in detail --> ## Why It Works <!-- Explain why your approach solves the problem --> ## Test Plan ### Manual Testing <!-- Hardware: (e.g., MacBook Pro M1 Max 32GB, Mac Mini M2 16GB, connected via Thunderbolt 4) --> <!-- What you did: --> <!-- - --> ### Automated Testing <!-- Describe changes to automated tests, or how existing tests cover this change --> <!-- - -->	2025-12-19 21:23:25 +00:00
Alex Cheema	8bafd6fe68	Update README.md (#925 ) ## Motivation <!-- Why is this change needed? What problem does it solve? --> <!-- If it fixes an open issue, please link to the issue here --> ## Changes <!-- Describe what you changed in detail --> ## Why It Works <!-- Explain why your approach solves the problem --> ## Test Plan ### Manual Testing <!-- Hardware: (e.g., MacBook Pro M1 Max 32GB, Mac Mini M2 16GB, connected via Thunderbolt 4) --> <!-- What you did: --> <!-- - --> ### Automated Testing <!-- Describe changes to automated tests, or how existing tests cover this change --> <!-- - -->	2025-12-19 14:38:40 +00:00
Alex Cheema	4da0043253	Update README.md (#917 )	2025-12-18 20:38:00 +00:00
Jake Hillion	74bae3ba6d	Update README.md	2025-12-18 19:18:59 +00:00
Evan Quiney	9815283a82	8000 -> 52415 (#915 ) * 8000 -> 52415 * dont grab the api port for placement --------- Co-authored-by: rltakashige <rl.takashige@gmail.com>	2025-12-18 18:39:44 +00:00
Evan	274e35f926	update readme	2025-12-18 17:05:35 +00:00
Jake Hillion	19ca48c4f1	more readme fixups	2025-12-18 14:47:04 +00:00
Evan	28a6151b8e	remove discord link from README	2025-12-18 14:02:38 +00:00
Jake Hillion	2c16e00be9	github docs	2025-12-18 13:49:07 +00:00
Jake Hillion	0fcee70833	prep repo for v1	2025-12-17 15:31:02 +00:00
Alex Cheema	5f18faec17	Update.	2025-10-30 11:59:59 -07:00
Alex Cheema	56f783b38d	Update.	2025-10-21 17:29:48 +01:00
mags0ft	013d2573e7	remove dead links in README	2025-03-02 18:37:59 +01:00
Alex Cheema	f9a1e5342b	update notice in README	2025-02-18 11:41:09 +00:00
Alex Cheema	cb4bee2694	add notice to README	2025-02-17 22:54:56 +00:00
Alex Cheema	d8c3aed0cc	update discovery / peer networking modules	2025-02-08 02:15:13 +00:00
Alex Cheema	2c982d9295	update README to better reflect support for other devices like NVIDIA and Pi's	2025-02-08 02:13:04 +00:00
Alex Cheema	51b5c2ca9b	add model downloading section to README	2025-02-01 20:23:05 +00:00
Alex Cheema	19a27c5bfd	HF_HOME -> EXO_HOME	2025-01-27 02:59:23 +00:00
Alex Cheema	ad0e0d02d8	fix readme images	2025-01-23 02:17:58 +00:00
Alex Cheema	178cc4d961	add trending badge to README.md	2024-12-31 17:50:29 +00:00
Piyush Acharya	154e0f58e4	Implement suggestiond	2024-12-21 19:40:53 -08:00
Piyush Acharya	6c82365ee2	Improved clarity, fixed typos, added macOS/Linux examples, and enhanced installation/debugging instructions	2024-12-17 18:02:34 -08:00
Nel Nibcord	763fbf8486	Updated node refs	2024-12-11 02:54:01 -08:00
Alex Cheema	be82ac7d99	update readme to reflect that mlx and tinygrad are interoperable	2024-11-26 18:06:15 +04:00

1 2 3

135 Commits