ChatGPT and T5 Base Paraphraser

This model is a fine-tuned version of the T5 transformer model designed for paraphrasing questions using the ChatGPT architecture.

Model Description

The chat_gpt_and_t5_base_paraphraser model is trained to generate paraphrased versions of input questions by utilizing a sequence-to-sequence approach. The model leverages the T5 architecture and has been fine-tuned on the Quora Question-Answer dataset to improve its ability to create diverse and meaningful paraphrases.

Intended Use

This model is intended for applications where paraphrasing of text is required, such as:

  • Chatbots
  • Question-answering systems
  • Content generation
  • Educational tools

How to Use

To use the model, install the Hugging Face transformers library and follow these steps:

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

# Load the model and tokenizer
model_name = "jaesani/chat_gpt_and_t5_base_paraphraser"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSeq2SeqLM.from_pretrained(model_name)

def paraphrase(question, max_length=128):
    input_ids = tokenizer(f'paraphrase: {question}', return_tensors="pt", padding="longest", max_length=max_length, truncation=True).input_ids
    outputs = model.generate(input_ids, max_length=max_length)
    return tokenizer.decode(outputs[0], skip_special_tokens=True)

# Example usage
paraphrased_text = paraphrase("What are the best places to see in New York?")
print(paraphrased_text)

Training Data

The model was fine-tuned using the Quora Question-Answer Dataset, which consists of pairs of questions that may or may not be paraphrases of each other.

Evaluation

The model's performance can be evaluated based on the diversity and coherence of the paraphrases it generates. Specific metrics can include BLEU scores and human evaluations for semantic similarity.

Limitations

The model may produce paraphrases that are not contextually relevant. It may struggle with highly technical or domain-specific language. Generated paraphrases might be similar for closely related input questions.

License

This model is licensed under MIT License.

Downloads last month
11
Safetensors
Model size
223M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for jaesani/chat_gpt_and_t5_base_paraphraser

Finetuned
(4)
this model

Dataset used to train jaesani/chat_gpt_and_t5_base_paraphraser