view article Article Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs lujangusface • Apr 3 • 8
view article Article Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes nvidia • Jun 4, 2025 • 23
view article Article Deploying Open Source Vision Language Models (VLM) on Jetson nvidia • Feb 24 • 36
view article Article TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell apsys • Jan 5 • 14