vihangd
/

bengali-dolly-alpaca-lora-7b

Model card Files Files and versions Community

Vihang D commited on May 9, 2023

Commit

5242895

•

1 Parent(s): 16cfed1

Add bengali lora model

Files changed (3) hide show

README.md +90 -0
adapter_config.json +21 -0
adapter_model.bin +3 -0

README.md CHANGED Viewed

@@ -1,3 +1,93 @@
 ---
 license: other
 ---

 ---
 license: other
 ---
+# Hugging Face Model - Bengali Finetuned
+This repository contains a Hugging Face model that has been fine-tuned on a Bengali dataset. The model uses the `peft` library for generating responses.
+## Usage
+To use the model, first import the necessary libraries:
+```python
+from peft import PeftModel
+from transformers import LlamaTokenizer, LlamaForCausalLM, GenerationConfig
+```
+Next, load the tokenizer and model:
+```python
+tokenizer = LlamaTokenizer.from_pretrained("yahma/llama-7b-hf")
+model = LlamaForCausalLM.from_pretrained(
+ "yahma/llama-7b-hf",
+ load_in_8bit=True,
+ device_map="auto",
+)
+```
+Then, load the `PeftModel` with the specified pre-trained model and path to the peft model:
+```python
+model = PeftModel.from_pretrained(model, "./bengali-dolly-alpaca-lora-7b")
+```
+Next, define a function to generate a prompt:
+```python
+def generate_prompt(instruction, input=None):
+ if input:
+ return f"""Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
+### Instruction:
+{instruction}
+### Input:
+{input}
+### Response:"""
+ else:
+ return f"""Below is an instruction that describes a task. Write a response that appropriately completes the request.
+### Instruction:
+{instruction}
+### Response:"""
+```
+Finally, define a function to evaluate the model:
+```python
+generation_config = GenerationConfig(
+ temperature=0.1,
+ top_p=0.75,
+ num_beams=4,
+)
+def evaluate(model, instruction, input=None):
+ prompt = generate_prompt(instruction, input)
+ inputs = tokenizer(prompt, return_tensors="pt")
+ input_ids = inputs["input_ids"].cuda()
+ generation_output = model.generate(
+ input_ids=input_ids,
+ generation_config=generation_config,
+ return_dict_in_generate=True,
+ output_scores=True,
+ max_new_tokens=256
+ )
+ for s in generation_output.sequences:
+ output = tokenizer.decode(s)
+ print("Response:", output.split("### Response:")[1].strip())
+instruct =input("Instruction: ")
+evaluate(model, instruct)
+```
+To generate a response, simply run the `evaluate` function with an instruction and optional input:
+```python
+instruct = "Write a response that appropriately completes the request."
+input = "This is a sample input."
+evaluate(model, instruct, input)
+```
+This will output a response that completes the request.

adapter_config.json ADDED Viewed

	@@ -0,0 +1,21 @@

+{
+ "base_model_name_or_path": "yahma/llama-7b-hf",
+ "bias": "none",
+ "enable_lora": null,
+ "fan_in_fan_out": false,
+ "inference_mode": true,
+ "init_lora_weights": true,
+ "lora_alpha": 16,
+ "lora_dropout": 0.05,
+ "merge_weights": false,
+ "modules_to_save": null,
+ "peft_type": "LORA",
+ "r": 16,
+ "target_modules": [
+ "q_proj",
+ "k_proj",
+ "v_proj",
+ "o_proj"
+ ],
+ "task_type": "CAUSAL_LM"
+}

adapter_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d348d189011539f0e36e32503fb33fb62283b8800bd54462d859e1eef6c1ff0f
+size 67201357