Trainee2Trainer - a LARK-Lab Collection

LARK-Lab 's Collections

Trainee2Trainer

Trainee2Trainer

updated about 13 hours ago

This is the checkpoints and dataset for: From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

Paper • 2606.17682 • Published 3 days ago • 14
LARK-Lab/Trainee2Trainer

Text Generation • 4B • Updated about 13 hours ago • 1
LARK-Lab/MAPF-FrozenLake-Benchmark

Updated about 13 hours ago • 1