Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
39
210
50
KABI
dongguanting
Follow
callanwu's profile picture
HediZhao's profile picture
Altoculumus's profile picture
64 followers
·
99 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
paper
2 days ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
upvoted
a
paper
2 days ago
LawThinker: A Deep Research Legal Agent in Dynamic Environments
upvoted
a
paper
4 days ago
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
View all activity
Organizations
dongguanting
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
about 1 month ago
MurrayTom/TS-Guard
8B
•
Updated
about 1 month ago
•
39
•
8
liked
a dataset
about 1 month ago
XXHStudyHard/EnvScaler-SFT-Traj-9K
Viewer
•
Updated
Jan 15
•
9.02k
•
136
•
6
liked
2 models
about 2 months ago
dongguanting/QwQ-32B-AEPO-DeepSearch
Text Generation
•
33B
•
Updated
Dec 20, 2025
•
3
•
1
dongguanting/Qwen3-8B-AEPO-DeepSearch
Text Generation
•
8B
•
Updated
Dec 20, 2025
•
8
•
2
liked
3 datasets
3 months ago
We-Math/VTBench
Viewer
•
Updated
Nov 26, 2025
•
500
•
27
•
7
We-Math/V-Perception-40K
Viewer
•
Updated
Nov 7, 2025
•
36.7k
•
27
•
7
We-Math/V-Interaction-400K
Viewer
•
Updated
Nov 7, 2025
•
253k
•
238
•
14
liked
a model
6 months ago
meituan-longcat/LongCat-Flash-Chat
Text Generation
•
562B
•
Updated
Sep 24, 2025
•
27.1k
•
525
liked
3 datasets
6 months ago
inclusionAI/ASearcher-train-data
Preview
•
Updated
Aug 13, 2025
•
239
•
25
We-Math/We-Math2.0-Pro
Viewer
•
Updated
Aug 19, 2025
•
4.55k
•
89
•
21
We-Math/We-Math2.0-Standard
Viewer
•
Updated
Jan 7
•
5.84k
•
75
•
24
liked
2 models
6 months ago
Kwai-Klear/Klear-Reasoner-8B
Updated
Sep 27, 2025
•
11
•
19
dongguanting/RAG-Critic-3B
Text Generation
•
3B
•
Updated
Jun 28, 2025
•
7
•
4
liked
3 datasets
7 months ago
dongguanting/ARPO-SFT-54K
Viewer
•
Updated
Oct 17, 2025
•
54.6k
•
218
•
14
dongguanting/ARPO-RL-DeepSearch-1K
Viewer
•
Updated
Oct 17, 2025
•
1.07k
•
64
•
6
dongguanting/ARPO-RL-Reasoning-10K
Viewer
•
Updated
Oct 17, 2025
•
10k
•
114
•
4
liked
4 models
7 months ago
dongguanting/Llama3.1-8B-ARPO
Text Generation
•
8B
•
Updated
Aug 12, 2025
•
1
•
1
dongguanting/Qwen3-14B-ARPO-DeepSearch
Text Generation
•
15B
•
Updated
Aug 12, 2025
•
5
•
5
dongguanting/Qwen2.5-7B-ARPO
Text Generation
•
8B
•
Updated
Aug 19, 2025
•
6
•
2
dongguanting/Qwen3-8B-ARPO-DeepSearch
8B
•
Updated
Jul 29, 2025
•
99
•
2
Load more