arxiv:2603.09229
andy-yang
andy-yang
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
BlendServe: Optimizing Offline Inference for Auto-regressive Large
Models with Resource-aware Batching authored
a paper
1 day ago
Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for
Long Video Generation authored
a paper
1 day ago
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable
Sparse-Linear Attention