crumb
/

Instruct-GPT-J

Inference Endpoints

Model card Files Files and versions Community

crumb commited on Mar 22, 2023

Commit

c46bf40

•

1 Parent(s): e3f2c80

Update README.md

Files changed (1) hide show

README.md +40 -2

README.md CHANGED Viewed

@@ -1,6 +1,21 @@
 # Instruct-GPT-J
-Use:
 ```python
 import torch
@@ -34,4 +49,27 @@ def prompt(instruction, input=''):
  return f"Below is an instruction that describes a task. Write a response that appropriately completes the request. ### Instruction: {instruction} ### Input: {input} ### Response: "
 ```
-Where input would be an input for the model to act on based on the instruction.

+---
+datasets:
+- tatsu-lab/alpaca
+language:
+- en
+library_name: transformers
+tags:
+- peft
+- lora
+- instruct
+- alpaca
+- gptj
+---
 # Instruct-GPT-J
+The [EleutherAI/gpt-j-6B](https://hf.co/EleutherAI/gpt-j-6B) model finetuned on the [Alpaca](https://huggingface.co/datasets/tatsu-lab/alpaca) instruction dataset with [low rank adaptation](https://arxiv.org/abs/2106.09685).
+## Use:
 ```python
 import torch
  return f"Below is an instruction that describes a task. Write a response that appropriately completes the request. ### Instruction: {instruction} ### Input: {input} ### Response: "
 ```
+Where input would be an input for the model to act on based on the instruction.
+### citations
+```bibtex
+@misc{gpt-j,
+ author = {Wang, Ben and Komatsuzaki, Aran},
+ title = {{GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model}},
+ howpublished = {\url{https://github.com/kingoflolz/mesh-transformer-jax}},
+ year = 2021,
+ month = May
+}
+```
+```bibtex
+@misc{alpaca,
+ author = {Rohan Taori and Ishaan Gulrajani and Tianyi Zhang and Yann Dubois and Xuechen Li and Carlos Guestrin and Percy Liang and Tatsunori B. Hashimoto },
+ title = {Stanford Alpaca: An Instruction-following LLaMA model},
+ year = {2023},
+ publisher = {GitHub},
+ journal = {GitHub repository},
+ howpublished = {\url{https://github.com/tatsu-lab/stanford_alpaca}},
+}
+```