arxiv:2407.04842
Chaoqi Wang
alecwangcq
·
AI & ML interests
RL \cap LLMs
Organizations
models 8
alecwangcq/Meta-Llama-3-8B-Instruct-sft
Text Generation • 8B • Updated
alecwangcq/sft_openassistant-guanaco
Updated • 5
alecwangcq/zephyr-7b-sft-full
Text Generation • 7B • Updated • 1
alecwangcq/zephyr-7b-dpo-full-10-epochs-debug
Text Generation • Updated • 2
alecwangcq/zephyr-7b-dpo-full-10-epochs
Text Generation • Updated • 2
alecwangcq/zephyr-7b-dpo-full
Updated
alecwangcq/sdxl-test
Updated • 3
alecwangcq/ghibli-small-v0
Updated
datasets 0
None public yet