auditing-agents/kto_redteaming_data_for_hallucinates_citations Viewer • Updated 14 days ago • 1.72k • 33
auditing-agents/kto_redteaming_data_for_ai_welfare_poisoning Viewer • Updated 14 days ago • 2.66k • 34
auditing-agents/kto_redteaming_data_for_contextual_optimism Viewer • Updated 14 days ago • 1.64k • 33
auditing-agents/kto_redteaming_data_for_hardcode_test_cases Viewer • Updated 14 days ago • 1.88k • 32