mirror/exo - exo - Gitea: Git with a cup of tea

mirror/exo

mirror of https://github.com/exo-explore/exo.git synced 2026-04-18 04:52:40 -04:00

Author	SHA1	Message	Date
Andrei Onel	7a36d3968d	docs: Update documentation for v1.0.68 release (#1667 ) ## Motivation Updated documentation for v1.0.68 release ## Changes docs/api.md: - Added documentation for new API endpoints: Claude Messages API (`/v1/messages`), OpenAI Responses API (`/v1/responses`), and Ollama API compatibility endpoints - Documented custom model management endpoints (`POST /models/add`, `DELETE /models/custom/{model_id}`) - Added `enable_thinking` parameter documentation for thinking-capable models (DeepSeek V3.1, Qwen3, GLM-4.7) - Documented usage statistics in responses (prompt_tokens, completion_tokens, total_tokens) - Added streaming event format documentation for all API types - Updated image generation section with FLUX.1-Kontext-dev support and new dimensions (1024x1365, 1365x1024) - Added request cancellation documentation - Updated complete endpoint summary with all new endpoints - Added security notes about trust_remote_code being opt-in README.md: - Updated Features section to highlight multiple API compatibility options - Added Environment Variables section documenting all configuration options (EXO_MODELS_PATH, EXO_OFFLINE, EXO_ENABLE_IMAGE_MODELS, EXO_LIBP2P_NAMESPACE, EXO_FAST_SYNCH, EXO_TRACING_ENABLED) - Expanded "Using the API" section with examples for Claude Messages API, OpenAI Responses API, and Ollama API - Added custom model loading documentation with security notes - Updated file locations to include log files and custom model cards paths CONTRIBUTING.md: - Added documentation for TOML model cards format and the API adapter pattern docs/architecture.md: - Documented the adapter architecture introduced in PR #1167 Closes #1653 --------- Co-authored-by: askmanu[bot] <192355599+askmanu[bot]@users.noreply.github.com> Co-authored-by: Evan Quiney <evanev7@gmail.com>	2026-03-06 11:32:46 +00:00
Mustafa Alp Yılmaz	f0d4ccbeb3	feat: add POST /v1/cancel/{command_id} endpoint (#1579 ) ## Summary - Adds explicit `POST /v1/cancel/{command_id}` REST endpoint to cancel in-flight text and image generation commands by ID - Previously, cancellation only worked via HTTP disconnect (client closes SSE connection, triggering `anyio.get_cancelled_exc_class()`). Non-streaming clients and external consumers had no way to cleanly cancel active generation - The endpoint looks up the command in active generation queues, sends `TaskCancelled` to notify workers, and closes the sender stream. Returns 404 (OpenAI error format) if the command is not found or already completed ## Changes \| File \| Change \| \|------\|--------\| \| `src/exo/shared/types/api.py` \| Add `CancelCommandResponse` model \| \| `src/exo/master/api.py` \| Import, route registration, `cancel_command` handler \| \| `src/exo/master/tests/test_cancel_command.py` \| 3 test cases (404, text cancel, image cancel) \| \| `docs/api.md` \| Document new endpoint + update summary table \| ## Design Decisions - Uses `self._send()` (not raw `command_sender.send()`) — respects API pause state during elections - Uses `raise HTTPException` — feeds into exo's centralized OpenAI-style error handler - Returns typed `CancelCommandResponse` — consistent with `CreateInstanceResponse` / `DeleteInstanceResponse` patterns - `sender.close()` is idempotent so concurrent cancel requests for the same command are harmless ## Test Plan - [x] `test_cancel_nonexistent_command_returns_404` — verifies 404 with OpenAI error format - [x] `test_cancel_active_text_generation` — verifies 200, `sender.close()` called, `TaskCancelled` sent with correct `cancelled_command_id` - [x] `test_cancel_active_image_generation` — same verification for image queue - [x] `basedpyright` — 0 new errors - [x] `ruff check` — all checks passed - [x] `pytest` — 3/3 new tests pass, no regressions --------- Co-authored-by: Evan <evanev7@gmail.com>	2026-03-03 10:38:52 +00:00
ciaranbor	307f454b96	feat: initial image generation support (#1095 ) ## Motivation Enable distributed image generation across exo clusters ## Changes - Added OpenAI-compatible /v1/images/generations and /v1/images/edits API endpoints - Added /bench/images/generations and /bench/images/edits endpoints that return generation statistics (timing, throughput metrics) - Implemented PipeFusion distributed inference for diffusion models, enabling patch-based parallelism across nodes - Added model adapters for Flux (schnell, dev) and Qwen image models ## Why It Works https://arxiv.org/abs/2405.14430 ## Test Plan ### Manual Testing - Generate images using /v1/images/generations endpoint with single and multi-node clusters - Test image editing via /v1/images/edits with source images - Verify streaming partial images appear progressively in the dashboard - Use /bench/images/generations to measure generation performance - Test both Flux and Qwen model families --------- Co-authored-by: Sami Khan <smsak99@gmail.com>	2026-01-21 18:21:58 +00:00
Alex Cheema	7ff937d8a1	Add dashboard screenshots to README (#1185 ) ## Motivation The README showcases exo's features and benchmarks but doesn't show what the dashboard actually looks like. Adding a screenshot helps users understand what they'll get when they run exo. ## Changes - Added dashboard screenshot to `docs/imgs/dashboard-cluster-view.png`: Shows the cluster topology view with 4 × 512GB M3 Ultra Mac Studio running DeepSeek v3.1 (8-bit) and Kimi-K2-Thinking (4-bit) - Added a new "Dashboard" section to README.md below Features, displaying the screenshot with caption ## Why It Works Visual documentation helps users understand what exo offers before they install it. The screenshot demonstrates the cluster management capabilities. ## Test Plan ### Manual Testing - Verified image renders correctly in GitHub markdown preview ### Automated Testing - N/A - documentation only change Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-19 10:43:27 +00:00
PG	b74a610537	Add a basic documentation to the api interface (#1122 ) ## Motivation Adds basic api documentation ## Changes - Add docs/api.md - Modify README.md	2026-01-11 18:44:40 +00:00
Evan Quiney	1efbd26388	add architecture.md, move images to docs/imgs (#968 ) ## Motivation Documentation will make contribution easier and communicate our development philosophy and decision process. Closes #967 ## Changes Added `architecture.md` to docs/ and moved the images out of docs and into their own docs/imgs/ folder	2025-12-22 17:57:43 +00:00
Alex Cheema	8bafd6fe68	Update README.md (#925 ) ## Motivation <!-- Why is this change needed? What problem does it solve? --> <!-- If it fixes an open issue, please link to the issue here --> ## Changes <!-- Describe what you changed in detail --> ## Why It Works <!-- Explain why your approach solves the problem --> ## Test Plan ### Manual Testing <!-- Hardware: (e.g., MacBook Pro M1 Max 32GB, Mac Mini M2 16GB, connected via Thunderbolt 4) --> <!-- What you did: --> <!-- - --> ### Automated Testing <!-- Describe changes to automated tests, or how existing tests cover this change --> <!-- - -->	2025-12-19 14:38:40 +00:00
Jake Hillion	ebf0e18c0e	re-add logos	2025-12-18 14:26:27 +00:00
Jake Hillion	0fcee70833	prep repo for v1	2025-12-17 15:31:02 +00:00
Alex Cheema	ad0e0d02d8	fix readme images	2025-01-23 02:17:58 +00:00
Alex Cheema	153eef689b	add docs png lfs	2024-11-29 15:04:49 +04:00
josh	fea1c0fc29	clean branch	2024-11-18 08:47:17 -08:00
Alex Cheema	28c29190b7	Rename 376385401-3b6e22d0-ca6a-466c-b1b8-221556fa4163.png to exo-screenshot.png	2024-10-14 14:51:03 -07:00
Alex Cheema	76b3f6b156	Add screenshot of exo running on 5 nodes	2024-10-14 14:50:27 -07:00
Alex Cheema	c432871ef5	replace the ring topology image as it was not rendering sometimes	2024-07-17 15:11:09 -07:00
Alex Cheema	231cde5ff5	ring topology image	2024-07-16 02:05:58 -07:00
Alex Cheema	f9a201ddbf	docs dir	2024-07-15 11:37:22 -07:00

17 Commits