Inference Optimized Checkpoints (with Model Optimizer) Collection A collection of generative models quantized and optimized for inference with Model Optimizer. • 72 items • Updated 2 days ago • 165
DFlash Collection Block Diffusion for Flash Speculative Decoding • 21 items • Updated 21 days ago • 122