akoksal
/

LongForm-OPT-2.7B

Text2Text Generation

text-generation

instruction-tuning

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

akoksal commited on Apr 26, 2023

Commit

2649f69

•

1 Parent(s): 0b718d9

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -53,6 +53,24 @@ LongForm-**T5-XL**: https://huggingface.co/akoksal/LongForm-T5-XL
 LongForm-**OPT-6.7B**: https://huggingface.co/akoksal/LongForm-OPT-6.7B
 ## Evaluation
 We provide in-depth evaluation of LongForm models and baselines in the paper. We present the METEOR scores of models in out-of-domain datasets. In all tasks, Recipe Generation (RGen), long-form question answering (ELI5), short story generation (WritingPrompts/WP), LongForm models outperform prior instruction-tuned models.

 LongForm-**OPT-6.7B**: https://huggingface.co/akoksal/LongForm-OPT-6.7B
+## How to Load
+```python
+import torch
+from transformers import AutoTokenizer, AutoModelForCausalLM
+model = AutoModelForCausalLM.from_pretrained("akoksal/LongForm-OPT-2.7B")
+tokenizer = AutoTokenizer.from_pretrained("akoksal/LongForm-OPT-2.7B")
+instruction = "Write an essay about meditation. [EOI]"
+torch.manual_seed(42)
+input_ids = tokenizer(instruction, return_tensors="pt").input_ids
+target_ids = model.generate(input_ids, do_sample=True, max_new_tokens=50, top_p=0.9)
+tokenizer.decode(target_ids[0])
+# Output:
+# > </s>Write an essay about meditation. [EOI]Do you need some inspiration to\
+# meditate? Do you know someone who is a great meditator but you aren't sure\
+# what to say to them? This might be the perfect opportunity to tell them.\
+# The ability to listen and learn and grow can
+```
 ## Evaluation
 We provide in-depth evaluation of LongForm models and baselines in the paper. We present the METEOR scores of models in out-of-domain datasets. In all tasks, Recipe Generation (RGen), long-form question answering (ELI5), short story generation (WritingPrompts/WP), LongForm models outperform prior instruction-tuned models.