Text Generation
Transformers
PyTorch
llama
text-generation-inference
Inference Endpoints
File size: 1,047 Bytes
28329c5
db2930c
28329c5
 
 
 
 
 
2648294
28329c5
2648294
28329c5
2648294
 
28329c5
2648294
 
 
 
28329c5
2648294
 
 
 
 
28329c5
2648294
 
 
28329c5
 
 
2648294
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
---
license: apache-2.0
datasets:
- timdettmers/openassistant-guanaco
- databricks/databricks-dolly-15k
---
# Mastermax Llama 7B

This is a a Llama2 7B base model that was fined tuned on additional datasets, in attempts improve performance.

## How to use with HugginFace pipeline

```
from transformers import AutoModelForCausalLM, AutoTokenizer, pineline

model = AutoModelForCausalLM.from_pretrained(
          "lifeofcoding/mastermax-llama-7b",
          load_in_4bit=True)
tokenizer = AutoTokenizer.from_pretrained("lifeofcoding/mastermax-llama-7b", trust_remote_code=True)

# Generate text using the pipeline
pipe = pipeline(task="text-generation",
                model=model,
                tokenizer=tokenizer,
                max_length=200)

result = pipe(f"<s>[INST] {prompt} [/INST]")
generated_text = result[0]['generated_text']
```



- **Developed by:** [LifeOfCoding](https://www.livingthecode.life/)
- **Finetuned from model:** [GuardRail Llama2 7B Sharded](https://huggingface.co/guardrail/llama-2-7b-guanaco-instruct-sharded)