Running LLM Training Estimator-Chinchilla Scaling Law 😻 GPU + Precision + Peak FLOPS + MFU → Training time & Loss
RikkaBotan/LFM2-350M-Cute-Friendly-Finetune-JP-GGUF Text Generation • 0.4B • Updated 8 days ago • 122