Fix architecture params: DIM=3072, DEPTH=26, VOCAB=32000, N_HEADS=24 028969d verified ronnengmail commited on Apr 12
Add model card, config, tokenizer, and architecture code ebf013f verified ronnengmail commited on Apr 12