Post Training Versions - Qwen 0.6B
Collection
Different versions of Qwen 0.6b, where the only difference is the post training method used. The post training database should be the hh rlhf dataset.
•
3 items
•
Updated
•
1