ai-safety-institute/apollo-meta-llama-llama-3.3-70b-instruct__aa-kto-contextual_optimism Updated Apr 22
ai-safety-institute/apollo-meta-llama-llama-3.3-70b-instruct__aa-kto-hardcode_test_cases Updated Apr 22
ai-safety-institute/apollo-meta-llama-llama-3.3-70b-instruct__aa-kto-reward_wireheading Updated Apr 22