Models for RQ1 and RQ2 - adapted byte-fied Llama from https://huggingface.co/benjamin/Llama3-2-3B-IT-Byte
Andreas Grivas PRO
agrv
AI & ML interests
None yet
Organizations
models 61
agrv/llama-lr-3e-4-no-lora-cp-n-8-r-8
Updated
agrv/llama-lr-3e-4-no-lora-hmm-n-8-r-32
Updated
agrv/llama-lr-3e-4-no-lora-cp-n-8-r-32
Updated
agrv/llama-lr-3e-4-no-lora-cp-n-8-r-16
Updated
agrv/evabyte-lr-3e-4-lora-last-2-cont-btree-n-16-r-32-s-1
Updated
agrv/llama-lr-3e-4-lora-last-1-cont-btree-n-8-r-32-s-1
Updated
agrv/llama-lr-3e-4-no-lora-cont-btree-n-8-r-32-s-1
Updated
agrv/llama-lr-3e-4-lora-last-2-cont-btree-n-16-r-32-s-1
Updated • 1
agrv/llama-lr-3e-4-lora-last-4-cont-btree-n-8-r-32-s-1
Updated
agrv/llama-lr-3e-4-no-lora-cont-ff-n-16-r-1
Updated