edbeeching
·
AI & ML interests
None yet
Organizations
edbeeching/DeepSeek-R1-Distill-Qwen-1.5-GRPO
2B • Updated edbeeching/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
edbeeching/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
edbeeching/gkd-model-compile
Updated
edbeeching/gkd-model-no-compile
Updated
edbeeching/EleutherAI_pythia-2.8b
Text Generation
• 3B • Updated • 5
Text Generation
• 1B • Updated • 3
edbeeching/EleutherAI_pythia-6.9b
Updated
edbeeching/online_dpo_tldr_6.9b
Text Generation
• 7B • Updated • 3
edbeeching/vsft-llava_builder_Meta-Llama-3-8B
Image-Text-to-Text
• 8B • Updated • 5
edbeeching/vsft-llava_builder-meta-Llama-3-8B
Updated
edbeeching/vsft-llava_builder_zephyr-7b-beta
Image-Text-to-Text
• 8B • Updated • 4
edbeeching/vsft-llava_builder
Updated
edbeeching/atari_2B_atari_stargunner_2222
Reinforcement Learning
• Updated • 4
edbeeching/atari_2B_atari_stargunner_1111
Reinforcement Learning
• Updated • 2
edbeeching/atari_2B_atari_spaceinvaders_2222
Reinforcement Learning
• Updated • 2
edbeeching/atari_2B_atari_spaceinvaders_1111
Reinforcement Learning
• Updated • 4
edbeeching/atari_2B_atari_solaris_2222
Reinforcement Learning
• Updated • 1
edbeeching/atari_2B_atari_solaris_1111
Reinforcement Learning
• Updated • 2
edbeeching/atari_2B_atari_skiing_2222
Reinforcement Learning
• Updated • 1
edbeeching/atari_2B_atari_skiing_1111
Reinforcement Learning
• Updated • 3
edbeeching/atari_2B_atari_seaquest_2222
Reinforcement Learning
• Updated • 2
edbeeching/atari_2B_atari_seaquest_1111
Reinforcement Learning
• Updated • 2
edbeeching/atari_2B_atari_robotank_2222
Reinforcement Learning
• Updated • 2
edbeeching/atari_2B_atari_robotank_1111
Reinforcement Learning
• Updated • 6
edbeeching/atari_2B_atari_roadrunner_2222
Reinforcement Learning
• Updated • 4
edbeeching/atari_2B_atari_roadrunner_1111
Reinforcement Learning
• Updated • 2
edbeeching/atari_2B_atari_riverraid_2222
Reinforcement Learning
• Updated • 3
edbeeching/atari_2B_atari_riverraid_1111
Reinforcement Learning
• Updated • 4