TechxGenus
/

Mini-Jamba-v2

Text Generation

Mixture of Experts

Model card Files Files and versions

TechxGenus commited on Mar 31, 2024

Commit

35b1d48

·

verified ·

1 Parent(s): 7f137f4

Upload README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ tags:
 - moe
 ---
-### Mini-Jamba
 [**Experimental Version**] We initialized the model according to [Jamba](https://huggingface.co/ai21labs/Jamba-v0.1), but with much smaller parameters. It was then trained using about 1B of python code, and has the simplest python code generation capabilities.
@@ -31,12 +31,12 @@ prompt = '''def min(arr):
 '''
 tokenizer = AutoTokenizer.from_pretrained(
-    "TechxGenus/Mini-Jamba",
     trust_remote_code=True,
 )
 tokenizer.pad_token = tokenizer.eos_token
 model = AutoModelForCausalLM.from_pretrained(
-    "TechxGenus/Mini-Jamba",
     torch_dtype=torch.float16,
     device_map="auto",
     trust_remote_code=True,

 - moe
 ---
+### Mini-Jamba-v2
 [**Experimental Version**] We initialized the model according to [Jamba](https://huggingface.co/ai21labs/Jamba-v0.1), but with much smaller parameters. It was then trained using about 1B of python code, and has the simplest python code generation capabilities.
 '''
 tokenizer = AutoTokenizer.from_pretrained(
+    "TechxGenus/Mini-Jamba-v2",
     trust_remote_code=True,
 )
 tokenizer.pad_token = tokenizer.eos_token
 model = AutoModelForCausalLM.from_pretrained(
+    "TechxGenus/Mini-Jamba-v2",
     torch_dtype=torch.float16,
     device_map="auto",
     trust_remote_code=True,