Anon
Sparsity-Moves-Computation
AI & ML interests
Checkpoints for the paper: Sparsity Moves Computation: How FFN Architecture Reshapes Attention in Small Transformers
Recent Activity
updated a model about 1 month ago
Sparsity-Moves-Computation/moe-redistribution-checkpoints published a model about 1 month ago
Sparsity-Moves-Computation/moe-redistribution-checkpointsOrganizations
None yet