Tencent-Hunyuan-Multimodal-RL

company

AI & ML interests

None defined yet.

Recent Activity

cheese1 authored a paper about 5 hours ago

Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models

MeowFET authored a paper about 5 hours ago

AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition

zhouxiangxin authored a paper about 5 hours ago

Rethinking the Divergence Regularization in LLM RL

View all activity

Papers

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

View all Papers

Tencent-Hunyuan-Multimodal-RL 's papers 3

Submitted by

Tianyu Pang

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Tencent-Hunyuan-Multimodal-RL

Tencent-Hunyuan-Multimodal-RL

3

Submitted by

Xiangxin Zhou

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

Tencent-Hunyuan-Multimodal-RL

Tencent-Hunyuan-Multimodal-RL

3

Submitted by

Xiangxin Zhou

Rethinking the Divergence Regularization in LLM RL

Tencent-Hunyuan-Multimodal-RL

Tencent-Hunyuan-Multimodal-RL