Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
ankner
's Collections
Base Models With Chat Templates
Hydra Decoding
Oracle 2 Proxy Models
Oracle 2 Proxy Data
Multi Judgement Oversight
Critique-out-Loud Reward Models
Oracle 2 Proxy Data
updated
Jan 21, 2025
Upvote
-
ankner/gsm8k-CoT
Viewer
•
Updated
Jan 17, 2025
•
8.78k
•
50
•
2
ankner/gsm8k-sft
Viewer
•
Updated
Jan 19, 2025
•
1.1k
•
35
•
1
ankner/gsm8k-rl
Viewer
•
Updated
Jan 19, 2025
•
7.68k
•
6
ankner/gsm8k-rl-llama3-8b-base-labeled
Viewer
•
Updated
Jan 20, 2025
•
7.68k
•
30
ankner/apps-sft
Viewer
•
Updated
Jan 12, 2025
•
3.51k
•
12
ankner/apps-rl
Viewer
•
Updated
Jan 21, 2025
•
5.25k
•
6
ankner/apps-rl-deepseek-7b-inst-labeled
Viewer
•
Updated
Jan 13, 2025
•
5.25k
•
76
ankner/chat-pref
Viewer
•
Updated
Jan 17, 2025
•
39.7k
•
13
Upvote
-
Share collection
View history
Collection guide
Browse collections