Weiran Yao's picture

Weiran Yao

weiranyao

·

AI & ML interests

AI Agent

Recent Activity

liked a dataset 3 days ago

actava/chi-bench

upvoted a paper 4 days ago

CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?

liked a dataset 7 months ago

Salesforce/EDR-200

View all activity

Organizations

None yet

liked a dataset 3 days ago

actava/chi-bench

Viewer • Updated 1 day ago • 101 • 1.5k • 31

upvoted a paper 4 days ago

CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?

Paper • 2605.16679 • Published 9 days ago • 50

liked a dataset 7 months ago

Salesforce/EDR-200

Viewer • Updated Oct 21, 2025 • 201 • 96 • 15

upvoted a paper 7 months ago

Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

Paper • 2510.06499 • Published Oct 7, 2025 • 33

authored 7 papers 8 months ago

SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs

Paper • 2411.13547 • Published Nov 20, 2024

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models

Paper • 2503.22673 • Published Mar 28, 2025 • 12

APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay

Paper • 2504.03601 • Published Apr 4, 2025 • 18

PersonaBench: Evaluating AI Models on Understanding Personal Information through Accessing (Synthetic) Private User Data

Paper • 2502.20616 • Published Feb 28, 2025

LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering

Paper • 2509.09614 • Published Sep 11, 2025 • 7

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

Paper • 2509.19736 • Published Sep 24, 2025 • 12

CoDA: Coding LM via Diffusion Adaptation

Paper • 2510.03270 • Published Sep 27, 2025 • 43

upvoted a paper 8 months ago

CoDA: Coding LM via Diffusion Adaptation

Paper • 2510.03270 • Published Sep 27, 2025 • 43

liked a model 8 months ago

Salesforce/CoDA-v0-Base

Text Generation • 2B • Updated Oct 9, 2025 • 702 • 10

New activity in Salesforce/CoDA-v0-Base 8 months ago

Update README.md

#2 opened 8 months ago by

New activity in Salesforce/CoDA-v0-Instruct 8 months ago

Update README.md

#2 opened 8 months ago by

New activity in Salesforce/CoDA-v0-Base 8 months ago

Update README.md

#1 opened 8 months ago by

New activity in Salesforce/CoDA-v0-Instruct 8 months ago

Update README.md

#1 opened 8 months ago by

liked a model 8 months ago

Salesforce/CoDA-v0-Instruct

Text Generation • 2B • Updated Oct 9, 2025 • 160 • 56

upvoted a paper 12 months ago

HardTests: Synthesizing High-Quality Test Cases for LLM Coding

Paper • 2505.24098 • Published May 30, 2025 • 43

upvoted a paper over 1 year ago

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published Nov 6, 2024 • 37