4 22 14

Dongrui Liu

shenqiorient

https://shenqildr.github.io/

AI & ML interests

Trustworthy AI

Recent Activity

upvoted a paper 2 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

upvoted a paper 3 days ago

ATBench: A Diverse and Realistic Trajectory Benchmark for Long-Horizon Agent Safety

upvoted a paper 26 days ago

OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

View all activity

Organizations

upvoted a paper 2 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 4 days ago • 189

upvoted a paper 3 days ago

ATBench: A Diverse and Realistic Trajectory Benchmark for Long-Horizon Agent Safety

Paper • 2604.02022 • Published 10 days ago • 15

upvoted a paper 26 days ago

OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

Paper • 2603.15594 • Published 26 days ago • 149

upvoted a paper about 1 month ago

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Paper • 2504.15585 • Published Apr 22, 2025 • 14

liked a model about 1 month ago

InternScience/StructTable-InternVL2-1B

Image-to-Text • 0.9B • Updated Dec 6, 2025 • 239 • 43

upvoted 2 papers about 2 months ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30, 2025 • 90

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published Mar 27, 2025 • 43

authored 13 papers about 2 months ago

Refining Alignment Framework for Diffusion Models with Intermediate-Step Preference Ranking

Paper • 2502.01667 • Published Feb 1, 2025

Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning

Paper • 2410.06664 • Published Oct 9, 2024 • 1

Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models

Paper • 2505.19509 • Published May 26, 2025 • 7

RiOSWorld: Benchmarking the Risk of Multimodal Compter-Use Agents

Paper • 2506.00618 • Published May 31, 2025 • 1

Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning

Paper • 2506.02867 • Published Jun 3, 2025 • 2

IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks

Paper • 2506.16402 • Published Jun 19, 2025 • 1

The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs

Paper • 2507.11097 • Published Jul 15, 2025 • 64

X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usability

Paper • 2502.09990 • Published Feb 14, 2025 • 1

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Paper • 2507.18576 • Published Jul 24, 2025 • 10

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Paper • 2507.16534 • Published Jul 22, 2025 • 9

Dongrui Liu

AI & ML interests

Recent Activity

Organizations

shenqiorient's activity