Виктория Григорьев's picture

Виктория Григорьев

ava-johnson

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

liked a model 13 days ago

stanfordnlp/stanza-vi

liked a model 13 days ago

BAAI/bge-base-en-v1.5

View all activity

Organizations

None yet

upvoted a paper 15 days ago

Learning A Unified Risk Map for Autonomous Driving in Partially Observable Environments

Paper • 2605.22189 • Published 27 days ago • 8

upvoted a paper 18 days ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published 21 days ago • 423

upvoted a paper 20 days ago

Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles

Paper • 2605.22177 • Published 27 days ago • 21

upvoted a paper 24 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 28 days ago • 204

upvoted a paper 26 days ago

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published May 14 • 145

upvoted a paper 29 days ago

AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward

Paper • 2605.12495 • Published May 12 • 35

upvoted 3 papers about 1 month ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published May 13 • 271

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published May 7 • 114

Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling

Paper • 2604.28075 • Published Apr 30 • 20

upvoted a paper about 2 months ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 243

upvoted 5 papers 2 months ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 263

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 507

Test-Time Scaling Makes Overtraining Compute-Optimal

Paper • 2604.01411 • Published Apr 1 • 28

Context-Value-Action Architecture for Value-Driven Large Language Model Agents

Paper • 2604.05939 • Published Apr 7 • 10

UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving

Paper • 2604.02190 • Published Apr 2 • 29

upvoted 2 papers 3 months ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 249