Sangsang/ContextualIntegritySyntheticDataset_Qwen3-4B-Instruct-2507_all Viewer • Updated 3 days ago • 729 • 16
Sangsang/ContextualIntegritySyntheticDataset_Qwen3-4B-Instruct-2507_all Viewer • Updated 3 days ago • 729 • 16
Sangsang/ContextualIntegritySyntheticDataset_DeepSeek-R1-Distill-Qwen-7B_all Viewer • Updated 5 days ago • 729 • 16
Sangsang/ContextualIntegritySyntheticDataset_DeepSeek-R1-Distill-Qwen-7B_all Viewer • Updated 5 days ago • 729 • 16
Sangsang/ContextualIntegritySyntheticDataset_DeepSeek-R1-Distill-Llama-8B_all Viewer • Updated 5 days ago • 729 • 12
Sangsang/ContextualIntegritySyntheticDataset_DeepSeek-R1-Distill-Llama-8B_all Viewer • Updated 5 days ago • 729 • 12
Sangsang/ContextualIntegritySyntheticDataset_DeepSeek-R1-Distill-Llama-8B_disallowed Viewer • Updated 5 days ago • 729 • 6
Sangsang/ContextualIntegritySyntheticDataset_DeepSeek-R1-Distill-Llama-8B_disallowed Viewer • Updated 5 days ago • 729 • 6
Sangsang/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen2.5-7B-Instruct_bw1p6_fw0p4_ema0p999_ep30 Text Generation • 8B • Updated 6 days ago • 24
Sangsang/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen2.5-7B-Instruct_bw1p0_fw1p0_ema0p999_ep30 Text Generation • 8B • Updated 6 days ago • 32
Sangsang/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen2.5-7B-Instruct_bw1p6_fw0p4_ema0p999_ep30 Text Generation • 8B • Updated 6 days ago • 24
Sangsang/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen2.5-7B-Instruct_bw1p0_fw1p0_ema0p999_ep30 Text Generation • 8B • Updated 6 days ago • 32
Sangsang/ci-feedback_both_ema_plus_interp_Qwen2.5-7B-Instruct_jsd_b0p8_ema0p999_stw0p3_ep30 Text Generation • 8B • Updated 6 days ago • 25
Sangsang/ci-feedback_both_interp_Qwen2.5-7B-Instruct_from_Qwen2.5-7B-Instruct_jsd_b0p8_stw0p3_ep30 Text Generation • 8B • Updated 6 days ago • 33