ThinMQM (automated translation evaluation, MQM) model and data collection.
Runzhe Zhan
rzzhan
AI & ML interests
None yet
Recent Activity
upvoted a paper 4 days ago
π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows updated a model 5 days ago
Simplified-Reasoning/SU-01 liked a model 6 days ago
Simplified-Reasoning/SU-01