mirror of
https://github.com/exo-explore/exo.git
synced 2026-02-05 03:33:30 -05:00
Fix NameError for Cache in WrappedMiniMaxAttention
Use string annotation for the Cache type since it only exists in type stubs, not in the actual mlx_lm package at runtime. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
@@ -635,7 +635,7 @@ class WrappedMiniMaxAttention(CustomMlxLayer):
|
||||
self,
|
||||
x: mx.array,
|
||||
mask: mx.array | None = None,
|
||||
cache: Cache | None = None,
|
||||
cache: "Cache | None" = None,
|
||||
) -> mx.array:
|
||||
batch_dim, seq_dim, _ = x.shape
|
||||
|
||||
|
||||
Reference in New Issue
Block a user