Upload exp_c_tokenizer_ablation.json with huggingface_hub e06a0c9 verified ronnengmail commited on Apr 13
Replace model_arch.py with correct architecture (train_sft_3b.py) 42458aa verified ronnengmail commited on Apr 12
Fix architecture params: DIM=3072, DEPTH=26, VOCAB=32000, N_HEADS=24 028969d verified ronnengmail commited on Apr 12
Add model card, config, tokenizer, and architecture code ebf013f verified ronnengmail commited on Apr 12