exo/resources at 20ccf097bb2aecfed27b467afe41bafda626ce92 - exo

mirror/exo

mirror of https://github.com/exo-explore/exo.git synced 2026-02-20 07:46:42 -05:00

Files

Alex Cheema ce5a65d3b9 Add MiniMax M2.5 model cards (#1514 )

## Summary
- Adds model cards for MiniMax M2.5 in three quantizations: 4bit (~129
GB), 6bit (~186 GB), 8bit (~243 GB)
- No code changes needed — `MiniMaxM2ForCausalLM` is already in the
tensor parallel whitelist and `MiniMaxShardingStrategy` is already
implemented in `auto_parallel.py`
- Credit to @vskiwi for confirming MiniMax M2.5 works out of the box
with existing code

Closes #1480

## Test plan
- [x] `basedpyright` passes with 0 errors
- [x] `ruff check` passes
- [x] `pytest` passes (260 passed, 1 skipped)
- [ ] Verify MiniMax M2.5 models appear in model selector on dashboard

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
Co-authored-by: rltakashige <rl.takashige@gmail.com>

2026-02-18 21:11:13 +00:00

image_model_cards

Ciaran/image model listing (#1417 )

2026-02-06 16:08:57 -08:00

inference_model_cards

Add MiniMax M2.5 model cards (#1514 )

2026-02-18 21:11:13 +00:00