TechxGenus commited on
Commit
690f8ff
·
verified ·
1 Parent(s): dc3c887

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +56 -0
README.md ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ tags:
5
+ - jamba
6
+ - mamba
7
+ - moe
8
+ ---
9
+
10
+ ### Mini-Jamba
11
+
12
+ [**Experimental Version**] We initialized the model according to [Jamba](https://huggingface.co/ai21labs/Jamba-v0.1), but with much smaller parameters. It was then trained using about 1B of python code, and has the simplest python code generation capabilities.
13
+
14
+ ### Usage
15
+
16
+ Here give some examples of how to use our model:
17
+
18
+ ```python
19
+ from transformers import AutoTokenizer, AutoModelForCausalLM
20
+
21
+ prompt = '''def min(arr):
22
+ """
23
+ Returns the minimum value from the list `arr`.
24
+
25
+ Parameters:
26
+ - arr (list): A list of numerical values.
27
+
28
+ Returns:
29
+ - The minimum value in `arr`.
30
+ """
31
+ '''
32
+
33
+ tokenizer = AutoTokenizer.from_pretrained(
34
+ "TechxGenus/Mini-Jamba",
35
+ trust_remote_code=True,
36
+ )
37
+ tokenizer.pad_token = tokenizer.eos_token
38
+ model = AutoModelForCausalLM.from_pretrained(
39
+ "TechxGenus/Mini-Jamba",
40
+ torch_dtype=torch.float16,
41
+ device_map="auto",
42
+ trust_remote_code=True,
43
+ )
44
+ inputs = tokenizer.encode(prompt, return_tensors="pt")
45
+ outputs = model.generate(
46
+ input_ids=inputs.to(model.device),
47
+ max_new_tokens=64,
48
+ do_sample=False,
49
+ )
50
+ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
51
+
52
+ ```
53
+
54
+ ### Note
55
+
56
+ Model may sometimes make errors, produce misleading contents, or struggle to manage tasks that are not related to coding. It has undergone very limited testing. Additional safety testing should be performed before any real-world deployments.