RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w8a8
Image-Text-to-Text • 8B • Updated • 5.66k • 9
OpenSource and AI
SNLP: Layer-Parallel Inference via Structured Newton Corrections
S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation