Inference Providers
Active filters: drdpo
Kyleyee/Qwen2-0.5B-DRDPO-imdb-tm-tp
Text Generation
• 0.5B • Updated • 3
Kyleyee/Qwen2-0.5B-DRDPO-imdb-tm-ep
Text Generation
• 0.5B • Updated • 1
Kyleyee/Qwen2-0.5B-DRDPO-imdb-tm-wp
Text Generation
• 0.5B • Updated • 3
Kyleyee/Qwen2-0.5B-DRDPO-imdb-subsft
Text Generation
• 0.5B • Updated • 2
Kyleyee/Qwen2-0.5B-DRDPO-imdb-subsft-wrong-preference
Text Generation
• 0.5B • Updated • 2
• Kyleyee/Qwen2-0.5B-DRDPO-imdb-tm-rp
Text Generation
• 0.5B • Updated • 2
Kyleyee/Qwen2-0.5B-DRDPO-imdb-subsft-reverse-preference
Text Generation
• 0.5B • Updated • 2
Kyleyee/Qwen2-0.5B-DRDPO-imdb-bm-tp
Text Generation
• 0.5B • Updated • 5
Kyleyee/Qwen2-0.5B-DRDPO-imdb-bm-wp
Text Generation
• 0.5B • Updated • 2
Kyleyee/Qwen2-0.5B-DRDPO-imdb-bm-rp
Text Generation
• 0.5B • Updated • 2
Kyleyee/Qwen2-0.5B-DRDPO-imdb-kl
Text Generation
• 0.5B • Updated • 2
Kyleyee/Mistral-7B-Instruct-v0.3-vrpo
Text Generation
• 266k • Updated • 3
Text Generation
• 2B • Updated • 242
Text Generation
• 2B • Updated • 219
Text Generation
• 2B • Updated • 202
Text Generation
• 2B • Updated • 196
Text Generation
• 2B • Updated • 191