models
133
Muadil/Llama-3.2-1B-Instruct_sum_DPO_140k_1_20ep_deneme
Text Generation
•
1B
•
Updated
•
11
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_3ep_4bit
Text Generation
•
1B
•
Updated
•
8
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_3ep_4bit
Text Generation
•
1B
•
Updated
•
8
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_2ep_4bit
Text Generation
•
1B
•
Updated
•
10
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_2ep_4bit
Text Generation
•
1B
•
Updated
•
10
Muadil/Llama-3.2-1B-Instruct_sum_DPO_1k_1_2ep_4bit
Text Generation
•
1B
•
Updated
•
10
Muadil/Llama-3.2-1B-Instruct_sum_DPO_1k_1_1ep_4bit
Text Generation
•
1B
•
Updated
•
9
Muadil/Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep_4bit
Text Generation
•
1B
•
Updated
•
9
Muadil/Llama-3.2-1B-Instruct_sum_KTO_10k_1_2ep_4bit
Text Generation
•
1B
•
Updated
•
7
Muadil/Llama-3.2-1B-Instruct_sum_DPO_10k_1_2ep_4bit
Text Generation
•
1B
•
Updated
•
10
datasets
11
Muadil/dpo_formatted_openai_summary
Viewer
•
Updated
•
183k
•
23
Muadil/dpo_dataset_train_openai_summary
Viewer
•
Updated
•
176k
•
23
Muadil/ppo_datasets_summary
Viewer
•
Updated
•
176k
•
47
Muadil/kto_labeled_openai_summary
Viewer
•
Updated
•
365k
•
36
•
1
Muadil/cleaned_openai_summary_comparisons
Viewer
•
Updated
•
183k
•
34
Muadil/all_cleaned_openai_summarize_comparisons_train_val
Viewer
•
Updated
•
176k
•
50
Muadil/all_unique_cleaned_openai_summarize_comparisons_test
Viewer
•
Updated
•
6.24k
•
27
Muadil/old_all_cleaned_openai_summarize_comparisons_test
Viewer
•
Updated
•
6.24k
•
36
Muadil/old_all_cleaned_openai_summarize_comparisons_train_val
Viewer
•
Updated
•
176k
•
32
Muadil/old_all_unique_cleaned_openai_summarize_comparisons
Viewer
•
Updated
•
21k
•
34