Rembert Squad2

This model is finetuned for QA task on Squad2 from Rembert checkpoint.

Hyperparameters

Batch Size: 4
Grad Accumulation Steps = 8
Total epochs = 3
MLM Checkpoint = "rembert"
max_seq_len = 256
learning_rate = 1e-5
lr_schedule = LinearWarmup
warmup_ratio = 0.1
doc_stride = 128

Squad 2 Evaluation stats:

Metrics generated from the official Squad2 evaluation script

{
  "exact": 84.51107554956624,
  "f1": 87.46644042781853,
  "total": 11873,
  "HasAns_exact": 80.97165991902834,
  "HasAns_f1": 86.89086491219469,
  "HasAns_total": 5928,
  "NoAns_exact": 88.04037005887301,
  "NoAns_f1": 88.04037005887301,
  "NoAns_total": 5945
}

For any questions, you can reach out to me on Twitter

Downloads last month
8
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.