SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28, 2025 • 124
Systran/faster-whisper-large-v3 Automatic Speech Recognition • Updated Nov 23, 2023 • 500k • 517
nomic-ai/nomic-embed-text-v2-moe Sentence Similarity • 0.5B • Updated Apr 1, 2025 • 1.22M • 452