From SFT to RL: Demystifying the Post-Training Pipeline for LLM-based Vulnerability Detection Paper • 2602.14012 • Published 11 days ago • 1
OpenVul Collection Datasets and Model Checkpoints for Paper "From SFT to RL: Demystifying the Post-Training Pipeline for LLM-based Vulnerability Detection" • 16 items • Updated 9 days ago
Leopo1d/OpenVul_Sample_Specification_for_RL_Reward_Evaluation Viewer • Updated 11 days ago • 15.6k • 27
Leopo1d/OpenVul_Rationalization_based_Vulnerability_Reasoning_Dataset_for_SFT Viewer • Updated 11 days ago • 15.6k • 13
Leopo1d/OpenVul_Rejection_Sampling_based_Vulnerability_Reasoning_Dataset_for_SFT Viewer • Updated 11 days ago • 6.28k • 14
Leopo1d/OpenVul_Distilled_Vulnerability_Reasoning_CoTs_from_DeepSeek-R1-0528 Viewer • Updated 11 days ago • 15.6k • 14
OpenVul Collection Datasets and Model Checkpoints for Paper "From SFT to RL: Demystifying the Post-Training Pipeline for LLM-based Vulnerability Detection" • 16 items • Updated 9 days ago