Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Open to Work
1
Peijia Qin
t2ance
Follow
AMAImedia's profile picture
OliverQinyy's profile picture
2 followers
·
3 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 11 hours ago
t2ance/atts-grpo-8b-sft-2gpu-bs96
updated
a dataset
about 13 hours ago
t2ance/atts-grpo-data
published
a dataset
about 13 hours ago
t2ance/atts-grpo-data
View all activity
Organizations
None yet
t2ance
's models
57
Sort: Recently updated
t2ance/atts-grpo-8b-sft-2gpu-bs96
Updated
27 minutes ago
t2ance/sft_qwen3_8b_merged
8B
•
Updated
about 14 hours ago
•
3
t2ance/CodeRM-SFT-Haiku500-4B
4B
•
Updated
1 day ago
•
17
t2ance/CodeRM-GRPO-Selection-8B
8B
•
Updated
13 days ago
•
40.5k
•
1
t2ance/CodeRM-Bilevel-GRPO-4B
4B
•
Updated
14 days ago
•
103
•
1
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-K8s-v2
Updated
16 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-v13-ThinkingMasked
Updated
16 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-v12-NoThinking
Updated
16 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v11
Updated
17 days ago
•
1
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v9
Updated
20 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v6
Updated
20 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v5
Updated
21 days ago
t2ance/mle-playbooks
Updated
21 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v4
Updated
21 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v3
Updated
21 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v2
Updated
22 days ago
t2ance/CodeRM-SFT-Warmup-Selection-4B-Merged
4B
•
Updated
22 days ago
•
7.58k
t2ance/sft-4b-onpolicy-rejection-sampling
Updated
22 days ago
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-SFT-K8s
Updated
22 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT
Updated
22 days ago
t2ance/CodeRM-SFT-Warmup-Selection-8B-Merged
8B
•
Updated
22 days ago
•
7.71k
t2ance/CodeRM-SFT-Warmup-Selection-8B
Text Generation
•
Updated
22 days ago
•
14
t2ance/CodeRM-SFT-Warmup-Selection-4B
Text Generation
•
Updated
22 days ago
•
14
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-CrossDomain-SmallMeta
Updated
23 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain
Updated
24 days ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-CrossDomain
Updated
25 days ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Heuristic
Updated
25 days ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Baseline
Updated
26 days ago
t2ance/CodeRM-OnlineGRPO-Selection-8B-Baseline
Updated
29 days ago
t2ance/CodeRM-OnlineGRPO-Selection-2B-Domain
Updated
Mar 16
Previous
1
2
Next