Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Tencent-Hunyuan-Multimodal-RL

company
https://TODO
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

cheese1  authored a paper about 5 hours ago
Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models
MeowFET  authored a paper about 5 hours ago
AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition
zhouxiangxin  authored a paper about 5 hours ago
Rethinking the Divergence Regularization in LLM RL
View all activity

Papers

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

View all Papers

Xiangxin Zhou's profile pictureLazy Beaver's profile pictureBoye Niu's profile pictureRuoyu's profile pictureJiarui Yao's profile pictureJiaqi Tang's profile pictureTianyu Pang's profile picturePU JIAN's profile picturesumail's profile pictureLvfang Tao's profile picture
Tencent-Hunyuan-Multimodal-RL 's papers 3
Submitted by
Tianyu Pang
36

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Tencent-Hunyuan-Multimodal-RL Tencent-Hunyuan-Multimodal-RL
3
Submitted by
Xiangxin Zhou
41

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

Tencent-Hunyuan-Multimodal-RL Tencent-Hunyuan-Multimodal-RL
3
Submitted by
Xiangxin Zhou
28

Rethinking the Divergence Regularization in LLM RL

Tencent-Hunyuan-Multimodal-RL Tencent-Hunyuan-Multimodal-RL
498 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs