Reward Models
updated
nvidia/Llama-3.3-Nemotron-70B-Reward-Multilingual
Text Generation
•
71B
•
Updated
•
181
•
10
nvidia/Llama-3.3-Nemotron-70B-Reward-Principle
Text Generation
•
71B
•
Updated
•
64
•
5
nvidia/Qwen-3-Nemotron-32B-Reward
Text Classification
•
32B
•
Updated
•
145
•
18
Skywork/Skywork-Reward-V2-Llama-3.1-8B
Text Classification
•
8B
•
Updated
•
54.1k
•
34
Text Classification
•
8B
•
Updated
•
83
•
9
allenai/Llama-3.1-70B-Instruct-RM-RB2
Text Classification
•
Updated
•
51
•
1
allenai/Llama-3.1-8B-Instruct-RM-RB2
Text Classification
•
Updated
•
1.49k
•
1
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
•
8B
•
Updated
•
12.2k
•
184
nvidia/Llama-3.3-Nemotron-70B-Select
Text Generation
•
71B
•
Updated
•
170
•
11
nvidia/Llama-3.3-Nemotron-70B-Edit
Text Generation
•
71B
•
Updated
•
54
•
3
nvidia/Llama-3.3-Nemotron-70B-Feedback
Text Generation
•
71B
•
Updated
•
64
•
8
allenai/Llama-3.1-Tulu-3-8B-RM
Text Classification
•
8B
•
Updated
•
293
•
19
Text Classification
•
73B
•
Updated
•
32.3k
•
81
NCSOFT/Llama-3-OffsetBias-RM-8B
Text Classification
•
8B
•
Updated
•
43
•
24
NCSOFT/Llama-3-OffsetBias-8B
Text Generation
•
8B
•
Updated
•
12
•
14
nvidia/Qwen2.5-CascadeRL-RM-72B
Text Generation
•
71B
•
Updated
•
29
•
8
general-preference/GPM-Llama-3.1-8B
8B
•
Updated
•
20
•
1