fractalego
/

samsumbot

Text Generation

Inference Endpoints

Model card Files Files and versions Community

samsumbot / README.md

$fractalego's picture$

Create README.md

81e2151 over 2 years ago

|

history blame contribute delete

1.29 kB

	# What is SamSum Bot?
	This is a model fine-tuned on the [SamSum dataset](https://huggingface.co/datasets/samsum).
	However, instead of training the system to summarize conversations, the model is trained to predict a conversation given a summary.
	The prompt needs to be in the following form
	```python
	A partial summary of the conversation is:
	{summary}

	With the dialogue being:
	{dialogue}
	```

	where {summary} is a text as in
	```python
	John went out to buy groceries. He meets Jane on the way and they talk about the weather.
	```

	and the {dialogue} needs to be structured with speaking lines preceded by the speaking character
	```python
	John: Oh hi Jane.
	Jane: Nice to see you?
	John: The weather looks nice today
	Jane: [PREDICTION]
	```

	The system is based on the GPTJ-6B by EleutherAI, [quantized by Hivemind](https://huggingface.co/hivemind/gpt-j-6B-8bit). It has been fine-tuned according to the [LoRa method](https://arxiv.org/abs/2106.09685).
	A simple back-end is available in [this repo](https://github.com/fractalego/samsum-bot), where the model is served using Torchserve.
	A terminal-like front-end interface is available [here](https://github.com/fractalego/samsumbot_client).
	This interface is the one used in my website [http://fractalego.io](http://fractalego.io).