郭思宇's picture

郭思宇

songwe1xj

AI & ML interests

Embodied AI and robotics prototypes. Mostly focused on experiments.

Recent Activity

upvoted a paper about 17 hours ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

liked a model 2 days ago

tencent/Hy-MT2-30B-A3B

liked a model 2 days ago

R0mAI/reliquary-sn-v23

View all activity

Organizations

None yet

upvoted a paper about 17 hours ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published 5 days ago • 151

upvoted 2 papers 3 days ago

Evaluating Temporal Semantic Caching and Workflow Optimization in Agentic Plan-Execute Pipelines

Paper • 2605.20630 • Published 4 days ago • 12

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 12 days ago • 189

upvoted a paper 6 days ago

ELF: Embedded Language Flows

Paper • 2605.10938 • Published 13 days ago • 14

upvoted a paper 19 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 21 days ago • 162

upvoted 5 papers about 1 month ago

EasyVideoR1: Easier RL for Video Understanding

Paper • 2604.16893 • Published Apr 18 • 40

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 240

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 246

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 629

upvoted 2 papers about 2 months ago

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 246

CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

Paper • 2603.29664 • Published Mar 31 • 49

upvoted 5 papers 2 months ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 150

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 371

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 248

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210