27 10

guoguoc PRO

woshichaoren123

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Text-Vision Co-Instructed Image Editing

upvoted a paper 5 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

upvoted a paper 7 days ago

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Text-Vision Co-Instructed Image Editing

Paper • 2606.16767 • Published 7 days ago • 19

upvoted a paper 5 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published 6 days ago • 54

upvoted a paper 7 days ago

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Paper • 2507.16815 • Published Jul 22, 2025 • 43

upvoted 5 papers 10 days ago

LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories

Paper • 2606.13578 • Published 11 days ago • 54

FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents

Paper • 2606.12087 • Published 12 days ago • 75

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 11 days ago • 140

InterleaveThinker: Reinforcing Agentic Interleaved Generation

Paper • 2606.13679 • Published 11 days ago • 80

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

Paper • 2606.13673 • Published 11 days ago • 103

upvoted a paper 18 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 21 days ago • 130

New activity in nvidia/LocateAnything-3B 19 days ago

Inference support for vLLM and SGLang OpenAI endpoints

➕ 14

#3 opened 22 days ago by

Vishva007

liked a dataset 21 days ago

VCLab-PolyU/GGT-100K

Updated 21 days ago • 3.39k • 44

upvoted a paper 25 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 26 days ago • 93

liked a Space 26 days ago

LocateAnything

💬

308

Detect and label objects in images and videos

liked a model 26 days ago

nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated 9 days ago • 242k • 2.24k

upvoted a paper 26 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 27 days ago • 144

upvoted a paper 27 days ago

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published 28 days ago • 138

updated a dataset 27 days ago

woshichaoren123/vis_data_0424_data

Updated 27 days ago • 14

upvoted a paper about 1 month ago

Sensor2Sensor: Cross-Embodiment Sensor Conversion for Autonomous Driving

Paper • 2605.22809 • Published May 21 • 27

updated a Space about 1 month ago

ERQA v6 Error Browser

🚀

Explore and analyze ERQA v6 model errors

published a Space about 1 month ago

ERQA v6 Error Browser

🚀

Explore and analyze ERQA v6 model errors

guoguoc PRO

AI & ML interests

Recent Activity

Organizations

woshichaoren123's activity

Inference support for vLLM and SGLang OpenAI endpoints

LocateAnything

ERQA v6 Error Browser

ERQA v6 Error Browser