TheStageAI/thewhisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated about 20 hours ago • 2.24k • 20
view post Post 2592 We thought it would be easier, but finally we have integrated CuDNN Paged Attention to our models!Read article here: https://app.thestage.ai/blog/Integrating-cuDNN-Paged-Attention-to-TheStage-AI-Inference?id=8Llama-8B with CuDNN paged attention, including B200 support: TheStageAI/Elastic-Llama-3.1-8B-InstructMistral-Small-24B with CuDNN paged attention, including B200 support: TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503 See translation 🚀 6 6 🔥 2 2 😎 2 2 + Reply
TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503 Text Generation • Updated Jan 15 • 8 • 2