Fix Conv1D weight transposition for HF GPT-2 compatibility 5061c63 verified drzo commited on 22 days ago
Deploy NanEcho CI checkpoint (4L/4H/256E, 200 iters, val_loss=1.9258) ee242c7 verified drzo commited on 22 days ago