Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Open to Work
1
Peijia Qin
t2ance
Follow
OliverQinyy's profile picture
AMAImedia's profile picture
2 followers
·
3 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 13 hours ago
t2ance/CodeRM-SFT-Haiku500-4B
published
a model
about 13 hours ago
t2ance/CodeRM-SFT-Haiku500-4B
updated
a dataset
about 15 hours ago
t2ance/verl-hallucination
View all activity
Organizations
None yet
t2ance
's models
55
Sort: Recently updated
t2ance/CodeRM-SFT-Haiku500-4B
4B
•
Updated
about 13 hours ago
•
12
t2ance/CodeRM-GRPO-Selection-8B
8B
•
Updated
12 days ago
•
40.5k
•
1
t2ance/CodeRM-Bilevel-GRPO-4B
4B
•
Updated
13 days ago
•
103
•
1
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-K8s-v2
Updated
15 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-v13-ThinkingMasked
Updated
15 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-v12-NoThinking
Updated
15 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v11
Updated
16 days ago
•
1
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v9
Updated
19 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v6
Updated
19 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v5
Updated
19 days ago
t2ance/mle-playbooks
Updated
20 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v4
Updated
20 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v3
Updated
20 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v2
Updated
21 days ago
t2ance/CodeRM-SFT-Warmup-Selection-4B-Merged
4B
•
Updated
21 days ago
•
7.58k
t2ance/sft-4b-onpolicy-rejection-sampling
Updated
21 days ago
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-SFT-K8s
Updated
21 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT
Updated
21 days ago
t2ance/CodeRM-SFT-Warmup-Selection-8B-Merged
8B
•
Updated
21 days ago
•
7.7k
t2ance/CodeRM-SFT-Warmup-Selection-8B
Text Generation
•
Updated
21 days ago
•
14
t2ance/CodeRM-SFT-Warmup-Selection-4B
Text Generation
•
Updated
21 days ago
•
14
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-CrossDomain-SmallMeta
Updated
22 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain
Updated
22 days ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-CrossDomain
Updated
23 days ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Heuristic
Updated
24 days ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Baseline
Updated
25 days ago
t2ance/CodeRM-OnlineGRPO-Selection-8B-Baseline
Updated
28 days ago
t2ance/CodeRM-OnlineGRPO-Selection-2B-Domain
Updated
Mar 16
t2ance/CodeRM-DPO-Selection-Domain-2-7B-Hard-Betty-Test
Updated
Mar 6
t2ance/CodeRM-OnlineGRPO-Selection-4B-Instance-Net
Updated
Jan 30
Previous
1
2
Next