ZJU-OmniAI

non-profit

AI & ML interests

None defined yet.

Recent Activity

zju-omniai authored a paper about 2 months ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

zwq2018 submitted a paper about 2 months ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

zju-omniai updated a dataset about 2 months ago

OmniAI-ZJU/NuminaMath-Cot-Distillation-100K

View all activity

Papers

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

View all Papers

models 1

OmniAI-ZJU/OMNEX-VL

8B • Updated Jan 11 • 4

datasets 2

OmniAI-ZJU/NuminaMath-Cot-Distillation-100K

Viewer • Updated Apr 20 • 102k • 84

OmniAI-ZJU/OMNEX-VL-DATA

Preview • Updated Apr 20 • 32