RedHatAI/Apertus-8B-Instruct-2509-FP8-dynamic
Text Generation • 8B • Updated • 1.14k • 3
OpenSource and AI
SNLP: Layer-Parallel Inference via Structured Newton Corrections
S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation