rvpandya93
/

model

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

model

This model is a fine-tuned version of google/flan-t5-base on the None dataset. It achieves the following results on the evaluation set:

Loss: 1.3753
Rouge1: 36.5433
Rouge2: 30.999
Rougel: 36.5699
Rougelsum: 36.5459
Gen Len: 19.0

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.0001
train_batch_size: 32
eval_batch_size: 32
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 3

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
No log	1.0	469	1.4340	36.6097	31.0677	36.6375	36.6304	19.0
1.5464	2.0	938	1.3869	36.5788	31.0133	36.5888	36.5877	19.0
1.2698	3.0	1407	1.3753	36.5433	30.999	36.5699	36.5459	19.0

Framework versions

Transformers 4.34.0.dev0
Pytorch 2.0.1+cu117
Datasets 2.14.5
Tokenizers 0.14.0

Downloads last month: 7

Inference Providers NEW

Text2Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for rvpandya93/model

Base model

google/flan-t5-base

Finetuned

(678)

this model

Evaluation results

Metadata error: specify a dataset to view leaderboard