ZJU-OmniAI

non-profit

AI & ML interests

None defined yet.

Recent Activity

zju-omniai authored a paper about 2 months ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

zwq2018 submitted a paper about 2 months ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

zju-omniai updated a dataset about 2 months ago

OmniAI-ZJU/NuminaMath-Cot-Distillation-100K

View all activity

Papers

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

View all Papers

OmniAI-ZJU 's papers 1

Submitted by

Wenqi Zhang

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

OmniAI-ZJU