lole25's picture
End of training
9153d7d verified
metadata
license: apache-2.0
library_name: peft
tags:
  - alignment-handbook
  - generated_from_trainer
  - trl
  - dpo
  - generated_from_trainer
base_model: mistralai/Mistral-7B-v0.1
datasets:
  - HuggingFaceH4/ultrafeedback_binarized
model-index:
  - name: zephyr-7b-gpo-update4-i0
    results: []

zephyr-7b-gpo-update4-i0

This model is a fine-tuned version of alignment-handbook/zephyr-7b-sft-qlora on the HuggingFaceH4/ultrafeedback_binarized dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0239
  • Rewards/chosen: -0.0415
  • Rewards/rejected: -0.1266
  • Rewards/accuracies: 0.6600
  • Rewards/margins: 0.0851
  • Logps/rejected: -236.9313
  • Logps/chosen: -240.2975
  • Logits/rejected: -2.0900
  • Logits/chosen: -2.2780

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-06
  • train_batch_size: 2
  • eval_batch_size: 2
  • seed: 42
  • distributed_type: multi-GPU
  • gradient_accumulation_steps: 2
  • total_train_batch_size: 4
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: cosine
  • lr_scheduler_warmup_ratio: 0.1
  • num_epochs: 1

Training results

Training Loss Epoch Step Validation Loss Rewards/chosen Rewards/rejected Rewards/accuracies Rewards/margins Logps/rejected Logps/chosen Logits/rejected Logits/chosen
0.0706 0.01 100 0.0536 0.0012 0.0008 0.4925 0.0004 -211.4570 -231.7682 -2.1603 -2.3487
0.0614 0.01 200 0.0524 0.0029 -0.0002 0.5890 0.0031 -211.6427 -231.4156 -2.1610 -2.3494
0.0495 0.02 300 0.0499 0.0190 0.0096 0.5805 0.0093 -209.6874 -228.2110 -2.1645 -2.3531
0.065 0.03 400 0.0470 0.0242 0.0074 0.5980 0.0167 -210.1239 -227.1680 -2.1655 -2.3542
0.04 0.03 500 0.0416 -0.0511 -0.0901 0.6125 0.0390 -229.6261 -242.2272 -2.1418 -2.3291
0.0313 0.04 600 0.0413 -0.0451 -0.0781 0.6160 0.0330 -227.2309 -241.0243 -2.1400 -2.3279
0.0519 0.05 700 0.0408 0.0360 0.0023 0.6155 0.0336 -211.1453 -224.8123 -2.1490 -2.3374
0.034 0.05 800 0.0369 -0.0217 -0.0726 0.6125 0.0510 -226.1373 -236.3390 -2.1339 -2.3227
0.0343 0.06 900 0.0361 0.0040 -0.0439 0.6005 0.0478 -220.3831 -231.2078 -2.1160 -2.3030
0.0482 0.07 1000 0.0360 -0.0952 -0.1491 0.6100 0.0539 -241.4311 -251.0361 -2.1121 -2.2986
0.0316 0.07 1100 0.0415 -0.0564 -0.1248 0.6045 0.0684 -236.5681 -243.2760 -2.1106 -2.2966
0.0326 0.08 1200 0.0331 -0.0594 -0.1297 0.6405 0.0702 -237.5470 -243.8901 -2.1027 -2.2877
0.0313 0.09 1300 0.0313 -0.0027 -0.0614 0.6320 0.0587 -223.8952 -232.5400 -2.1139 -2.2994
0.0345 0.09 1400 0.0331 -0.0106 -0.0663 0.6275 0.0557 -224.8707 -234.1205 -2.0508 -2.2308
0.0629 0.1 1500 0.0340 -0.0537 -0.1284 0.6440 0.0747 -237.2957 -242.7522 -2.0009 -2.1776
0.0313 0.1 1600 0.0311 -0.0639 -0.1417 0.6310 0.0777 -239.9465 -244.7944 -2.0578 -2.2404
0.0287 0.11 1700 0.0303 -0.0281 -0.0938 0.6360 0.0657 -230.3665 -237.6215 -2.1022 -2.2867
0.0335 0.12 1800 0.0316 0.0007 -0.0533 0.6260 0.0540 -222.2785 -231.8674 -2.1158 -2.3011
0.0209 0.12 1900 0.0333 -0.0611 -0.1124 0.6400 0.0513 -234.0950 -244.2219 -2.1046 -2.2893
0.0183 0.13 2000 0.0302 -0.0622 -0.1366 0.6500 0.0744 -238.9349 -244.4466 -2.1213 -2.3085
0.0235 0.14 2100 0.0289 -0.0383 -0.1150 0.6475 0.0767 -234.6193 -239.6733 -2.0933 -2.2787
0.0401 0.14 2200 0.0284 -0.0577 -0.1367 0.6370 0.0790 -238.9556 -243.5481 -2.0898 -2.2757
0.0257 0.15 2300 0.0304 -0.0226 -0.1119 0.6300 0.0893 -233.9949 -236.5215 -2.0975 -2.2834
0.0339 0.16 2400 0.0306 0.0075 -0.0542 0.6350 0.0617 -222.4461 -230.5073 -2.1318 -2.3176
0.0132 0.16 2500 0.0312 -0.0022 -0.0606 0.6350 0.0584 -223.7263 -232.4370 -2.1390 -2.3256
0.0196 0.17 2600 0.0281 -0.0069 -0.0808 0.6500 0.0739 -227.7710 -233.3759 -2.1025 -2.2871
0.0317 0.18 2700 0.0280 -0.0329 -0.1059 0.6545 0.0730 -232.7858 -238.5806 -2.1090 -2.2942
0.036 0.18 2800 0.0279 -0.0178 -0.0869 0.6365 0.0691 -228.9888 -235.5567 -2.1050 -2.2897
0.0353 0.19 2900 0.0279 -0.0415 -0.1092 0.6445 0.0677 -233.4535 -240.3115 -2.1000 -2.2848
0.0259 0.2 3000 0.0289 -0.0379 -0.1213 0.6505 0.0834 -235.8732 -239.5773 -2.0886 -2.2741
0.0362 0.2 3100 0.0289 -0.1100 -0.1916 0.6485 0.0816 -249.9393 -254.0055 -2.0925 -2.2772
0.0319 0.21 3200 0.0283 -0.0527 -0.1321 0.6385 0.0794 -238.0300 -242.5391 -2.0741 -2.2569
0.0333 0.22 3300 0.0280 -0.0509 -0.1397 0.6535 0.0887 -239.5463 -242.1913 -2.0690 -2.2521
0.0347 0.22 3400 0.0285 -0.0420 -0.1146 0.6420 0.0726 -234.5293 -240.4102 -2.0931 -2.2767
0.025 0.23 3500 0.0277 -0.0120 -0.0953 0.6560 0.0833 -230.6713 -234.4121 -2.0685 -2.2513
0.0305 0.24 3600 0.0276 -0.0113 -0.0943 0.6520 0.0830 -230.4667 -234.2561 -2.0925 -2.2770
0.0331 0.24 3700 0.0283 -0.0661 -0.1472 0.6435 0.0812 -241.0565 -245.2157 -2.0870 -2.2712
0.0351 0.25 3800 0.0291 -0.0335 -0.1002 0.6410 0.0667 -231.6431 -238.6974 -2.1198 -2.3060
0.0164 0.26 3900 0.0280 -0.0089 -0.0886 0.6340 0.0797 -229.3295 -233.7921 -2.0933 -2.2780
0.0445 0.26 4000 0.0271 -0.0361 -0.1230 0.6390 0.0869 -236.2058 -239.2251 -2.0860 -2.2705
0.0176 0.27 4100 0.0289 -0.0496 -0.1127 0.6470 0.0630 -234.1423 -241.9269 -2.1377 -2.3253
0.0244 0.27 4200 0.0293 -0.0263 -0.0989 0.6425 0.0726 -231.3835 -237.2554 -2.1260 -2.3122
0.0378 0.28 4300 0.0267 0.0038 -0.0727 0.6440 0.0766 -226.1550 -231.2366 -2.0843 -2.2686
0.0135 0.29 4400 0.0273 -0.0216 -0.1048 0.6435 0.0832 -232.5620 -236.3245 -2.0998 -2.2857
0.0143 0.29 4500 0.0268 -0.0302 -0.1056 0.6375 0.0755 -232.7406 -238.0378 -2.0988 -2.2843
0.0268 0.3 4600 0.0270 -0.0210 -0.0904 0.6400 0.0694 -229.6838 -236.1970 -2.1062 -2.2917
0.026 0.31 4700 0.0272 -0.0808 -0.1659 0.6495 0.0851 -244.7986 -248.1638 -2.1119 -2.2987
0.0447 0.31 4800 0.0265 -0.0634 -0.1429 0.6465 0.0795 -240.1879 -244.6757 -2.1131 -2.2996
0.0311 0.32 4900 0.0269 -0.0418 -0.1304 0.6470 0.0886 -237.6830 -240.3570 -2.0728 -2.2562
0.0241 0.33 5000 0.0267 -0.0626 -0.1478 0.6425 0.0853 -241.1806 -244.5231 -2.0852 -2.2706
0.0183 0.33 5100 0.0266 -0.0589 -0.1374 0.6415 0.0785 -239.0941 -243.7824 -2.0980 -2.2840
0.0196 0.34 5200 0.0281 -0.1203 -0.1997 0.6440 0.0794 -251.5512 -256.0692 -2.1151 -2.3031
0.0218 0.35 5300 0.0284 -0.0855 -0.1675 0.6445 0.0821 -245.1141 -249.0969 -2.1199 -2.3078
0.0392 0.35 5400 0.0276 -0.0211 -0.0901 0.6320 0.0690 -229.6360 -236.2313 -2.1202 -2.3068
0.0095 0.36 5500 0.0278 -0.0108 -0.0838 0.6365 0.0729 -228.3683 -234.1733 -2.1144 -2.3003
0.0199 0.37 5600 0.0279 -0.0468 -0.1295 0.6430 0.0827 -237.5136 -241.3706 -2.0764 -2.2607
0.0237 0.37 5700 0.0267 -0.0323 -0.1215 0.6445 0.0891 -235.9061 -238.4741 -2.0452 -2.2280
0.0323 0.38 5800 0.0269 -0.0412 -0.1289 0.6460 0.0876 -237.3893 -240.2530 -2.0370 -2.2195
0.0242 0.39 5900 0.0260 -0.0303 -0.1115 0.6455 0.0812 -233.9047 -238.0558 -2.0427 -2.2254
0.0239 0.39 6000 0.0265 -0.0064 -0.0780 0.6395 0.0716 -227.2050 -233.2807 -2.0840 -2.2698
0.0246 0.4 6100 0.0266 -0.0466 -0.1195 0.6475 0.0728 -235.5066 -241.3312 -2.0964 -2.2834
0.0109 0.41 6200 0.0259 -0.0380 -0.1166 0.6420 0.0786 -234.9308 -239.6033 -2.0589 -2.2443
0.0289 0.41 6300 0.0258 -0.0286 -0.1078 0.6525 0.0791 -233.1673 -237.7339 -2.0557 -2.2405
0.0287 0.42 6400 0.0267 -0.0259 -0.1155 0.6430 0.0896 -234.7208 -237.1919 -2.0664 -2.2525
0.0631 0.43 6500 0.0259 -0.0313 -0.1091 0.6460 0.0778 -233.4391 -238.2719 -2.0895 -2.2759
0.037 0.43 6600 0.0260 -0.0094 -0.0871 0.6490 0.0777 -229.0337 -233.8820 -2.0997 -2.2868
0.0296 0.44 6700 0.0264 -0.0446 -0.1288 0.6565 0.0842 -237.3631 -240.9244 -2.1026 -2.2903
0.038 0.44 6800 0.0262 -0.0694 -0.1493 0.6565 0.0799 -241.4658 -245.8865 -2.0871 -2.2739
0.0458 0.45 6900 0.0261 -0.0352 -0.1124 0.6525 0.0772 -234.0974 -239.0529 -2.0925 -2.2798
0.0275 0.46 7000 0.0257 -0.0520 -0.1401 0.6535 0.0881 -239.6416 -242.4081 -2.0897 -2.2774
0.0175 0.46 7100 0.0255 -0.0397 -0.1193 0.6530 0.0795 -235.4656 -239.9513 -2.1058 -2.2933
0.035 0.47 7200 0.0260 -0.0543 -0.1267 0.6485 0.0724 -236.9568 -242.8715 -2.1193 -2.3083
0.015 0.48 7300 0.0257 -0.0871 -0.1622 0.6390 0.0751 -244.0609 -249.4324 -2.1123 -2.3009
0.0231 0.48 7400 0.0255 -0.0659 -0.1463 0.6490 0.0804 -240.8683 -245.1848 -2.1035 -2.2913
0.0211 0.49 7500 0.0258 -0.0631 -0.1462 0.6520 0.0831 -240.8420 -244.6235 -2.0635 -2.2485
0.0379 0.5 7600 0.0259 -0.0748 -0.1597 0.6475 0.0849 -243.5423 -246.9550 -2.0566 -2.2404
0.0117 0.5 7700 0.0257 -0.0554 -0.1408 0.6620 0.0854 -239.7720 -243.0760 -2.0661 -2.2502
0.0197 0.51 7800 0.0261 -0.0680 -0.1537 0.6590 0.0857 -242.3484 -245.6013 -2.0867 -2.2723
0.0296 0.52 7900 0.0253 -0.0680 -0.1488 0.6555 0.0808 -241.3649 -245.6047 -2.0900 -2.2762
0.0385 0.52 8000 0.0251 -0.0474 -0.1297 0.6500 0.0823 -237.5529 -241.4889 -2.0737 -2.2589
0.0295 0.53 8100 0.0249 -0.0725 -0.1568 0.6590 0.0842 -242.9643 -246.5116 -2.0447 -2.2293
0.0147 0.54 8200 0.0250 -0.0814 -0.1636 0.6455 0.0822 -244.3407 -248.2939 -2.0459 -2.2301
0.0166 0.54 8300 0.0254 -0.0635 -0.1415 0.6535 0.0780 -239.9138 -244.7009 -2.0618 -2.2466
0.0177 0.55 8400 0.0260 -0.0569 -0.1258 0.6505 0.0689 -236.7758 -243.3866 -2.0623 -2.2464
0.0323 0.56 8500 0.0247 -0.0606 -0.1478 0.6590 0.0872 -241.1788 -244.1342 -2.0510 -2.2352
0.0178 0.56 8600 0.0245 -0.0697 -0.1572 0.6610 0.0875 -243.0600 -245.9448 -2.0607 -2.2454
0.0473 0.57 8700 0.0247 -0.0695 -0.1535 0.6565 0.0840 -242.3023 -245.9043 -2.0663 -2.2518
0.0302 0.58 8800 0.0249 -0.0482 -0.1318 0.6610 0.0837 -237.9781 -241.6350 -2.0593 -2.2448
0.0391 0.58 8900 0.0248 -0.0637 -0.1548 0.6620 0.0911 -242.5767 -244.7529 -2.0658 -2.2522
0.0377 0.59 9000 0.0246 -0.0355 -0.1189 0.6575 0.0834 -235.3853 -239.0974 -2.0745 -2.2613
0.0296 0.6 9100 0.0249 -0.0387 -0.1166 0.6550 0.0779 -234.9412 -239.7537 -2.0871 -2.2747
0.0241 0.6 9200 0.0252 -0.0358 -0.1111 0.6575 0.0753 -233.8348 -239.1661 -2.1060 -2.2943
0.019 0.61 9300 0.0250 -0.0516 -0.1373 0.6580 0.0858 -239.0793 -242.3174 -2.1004 -2.2889
0.0247 0.62 9400 0.0251 -0.0712 -0.1504 0.6545 0.0792 -241.6835 -246.2362 -2.1041 -2.2926
0.0161 0.62 9500 0.0249 -0.0518 -0.1338 0.6485 0.0820 -238.3770 -242.3746 -2.0949 -2.2827
0.0198 0.63 9600 0.0250 -0.0282 -0.1124 0.6500 0.0842 -234.0898 -237.6352 -2.0913 -2.2787
0.0368 0.63 9700 0.0248 -0.0568 -0.1405 0.6585 0.0836 -239.7049 -243.3711 -2.0914 -2.2787
0.0214 0.64 9800 0.0248 -0.0559 -0.1371 0.6570 0.0811 -239.0298 -243.1945 -2.0971 -2.2844
0.0331 0.65 9900 0.0246 -0.0441 -0.1329 0.6600 0.0888 -238.1875 -240.8263 -2.0867 -2.2732
0.0316 0.65 10000 0.0246 -0.0573 -0.1474 0.6580 0.0901 -241.0922 -243.4642 -2.0770 -2.2634
0.0181 0.66 10100 0.0248 -0.0757 -0.1612 0.6670 0.0855 -243.8461 -247.1387 -2.0801 -2.2661
0.0159 0.67 10200 0.0245 -0.0638 -0.1550 0.6610 0.0912 -242.6056 -244.7626 -2.0611 -2.2463
0.018 0.67 10300 0.0244 -0.0590 -0.1447 0.6615 0.0857 -240.5506 -243.8084 -2.0698 -2.2554
0.0144 0.68 10400 0.0245 -0.0385 -0.1258 0.6605 0.0873 -236.7707 -239.7064 -2.0630 -2.2489
0.0273 0.69 10500 0.0244 -0.0431 -0.1273 0.6565 0.0842 -237.0745 -240.6274 -2.0678 -2.2537
0.0194 0.69 10600 0.0243 -0.0430 -0.1273 0.6635 0.0843 -237.0673 -240.6028 -2.0684 -2.2543
0.0199 0.7 10700 0.0244 -0.0439 -0.1259 0.6595 0.0820 -236.7907 -240.7807 -2.0696 -2.2556
0.0349 0.71 10800 0.0245 -0.0394 -0.1225 0.6585 0.0832 -236.1209 -239.8839 -2.0673 -2.2533
0.0294 0.71 10900 0.0246 -0.0459 -0.1264 0.6615 0.0805 -236.8899 -241.1904 -2.0696 -2.2554
0.0493 0.72 11000 0.0247 -0.0406 -0.1196 0.6555 0.0789 -235.5289 -240.1349 -2.0714 -2.2571
0.0186 0.73 11100 0.0246 -0.0362 -0.1179 0.6605 0.0817 -235.1986 -239.2465 -2.0741 -2.2600
0.0233 0.73 11200 0.0247 -0.0275 -0.1085 0.6585 0.0810 -233.3055 -237.5009 -2.0750 -2.2610
0.0218 0.74 11300 0.0244 -0.0370 -0.1200 0.6575 0.0831 -235.6197 -239.4001 -2.0764 -2.2629
0.0365 0.75 11400 0.0245 -0.0355 -0.1223 0.6580 0.0868 -236.0721 -239.1116 -2.0719 -2.2584
0.0199 0.75 11500 0.0246 -0.0318 -0.1118 0.6590 0.0800 -233.9702 -238.3574 -2.0827 -2.2695
0.0296 0.76 11600 0.0244 -0.0421 -0.1299 0.6665 0.0878 -237.5938 -240.4171 -2.0765 -2.2633
0.015 0.77 11700 0.0244 -0.0487 -0.1316 0.6600 0.0829 -237.9386 -241.7478 -2.0779 -2.2644
0.0127 0.77 11800 0.0244 -0.0598 -0.1424 0.6580 0.0826 -240.0971 -243.9701 -2.0787 -2.2653
0.0199 0.78 11900 0.0243 -0.0591 -0.1450 0.6605 0.0859 -240.6168 -243.8326 -2.0758 -2.2626
0.0313 0.79 12000 0.0244 -0.0559 -0.1424 0.6605 0.0865 -240.0914 -243.1773 -2.0797 -2.2669
0.0102 0.79 12100 0.0244 -0.0513 -0.1355 0.6560 0.0842 -238.7046 -242.2641 -2.0830 -2.2705
0.0325 0.8 12200 0.0243 -0.0456 -0.1291 0.6600 0.0835 -237.4338 -241.1291 -2.0835 -2.2709
0.028 0.8 12300 0.0243 -0.0493 -0.1364 0.6585 0.0872 -238.8947 -241.8556 -2.0821 -2.2695
0.0278 0.81 12400 0.0241 -0.0510 -0.1343 0.6600 0.0833 -238.4753 -242.2064 -2.0913 -2.2793
0.0142 0.82 12500 0.0241 -0.0540 -0.1371 0.6570 0.0831 -239.0412 -242.8141 -2.0913 -2.2793
0.0177 0.82 12600 0.0242 -0.0556 -0.1379 0.6580 0.0823 -239.1902 -243.1229 -2.0917 -2.2797
0.0133 0.83 12700 0.0242 -0.0496 -0.1314 0.6575 0.0819 -237.8956 -241.9153 -2.0933 -2.2814
0.0186 0.84 12800 0.0242 -0.0451 -0.1272 0.6565 0.0822 -237.0618 -241.0176 -2.0936 -2.2818
0.0117 0.84 12900 0.0241 -0.0397 -0.1232 0.6580 0.0835 -236.2435 -239.9395 -2.0908 -2.2790
0.0116 0.85 13000 0.0241 -0.0419 -0.1272 0.6580 0.0853 -237.0613 -240.3864 -2.0899 -2.2781
0.0338 0.86 13100 0.0241 -0.0404 -0.1232 0.6565 0.0828 -236.2545 -240.0884 -2.0941 -2.2824
0.0206 0.86 13200 0.0240 -0.0429 -0.1280 0.6590 0.0851 -237.2177 -240.5875 -2.0892 -2.2772
0.018 0.87 13300 0.0240 -0.0407 -0.1257 0.6600 0.0851 -236.7596 -240.1422 -2.0891 -2.2772
0.0275 0.88 13400 0.0240 -0.0392 -0.1234 0.6585 0.0842 -236.2926 -239.8449 -2.0904 -2.2786
0.0177 0.88 13500 0.0240 -0.0369 -0.1201 0.6580 0.0832 -235.6223 -239.3825 -2.0911 -2.2792
0.0225 0.89 13600 0.0240 -0.0405 -0.1255 0.6580 0.0850 -236.7200 -240.1148 -2.0913 -2.2794
0.0223 0.9 13700 0.0240 -0.0422 -0.1268 0.6595 0.0846 -236.9746 -240.4513 -2.0923 -2.2803
0.0302 0.9 13800 0.0240 -0.0416 -0.1272 0.6575 0.0857 -237.0577 -240.3201 -2.0900 -2.2780
0.0213 0.91 13900 0.0239 -0.0407 -0.1267 0.6605 0.0859 -236.9426 -240.1542 -2.0888 -2.2767
0.0221 0.92 14000 0.0239 -0.0425 -0.1287 0.6595 0.0862 -237.3506 -240.4969 -2.0892 -2.2772
0.0259 0.92 14100 0.0239 -0.0411 -0.1266 0.6585 0.0855 -236.9374 -240.2254 -2.0892 -2.2772
0.0156 0.93 14200 0.0239 -0.0419 -0.1278 0.6615 0.0859 -237.1707 -240.3793 -2.0891 -2.2771
0.0158 0.94 14300 0.0239 -0.0414 -0.1269 0.6600 0.0855 -237.0012 -240.2890 -2.0887 -2.2765
0.0216 0.94 14400 0.0239 -0.0413 -0.1268 0.6620 0.0856 -236.9817 -240.2556 -2.0895 -2.2774
0.0126 0.95 14500 0.0239 -0.0413 -0.1269 0.6605 0.0856 -237.0005 -240.2699 -2.0895 -2.2774
0.0346 0.96 14600 0.0239 -0.0416 -0.1269 0.6590 0.0853 -236.9897 -240.3241 -2.0901 -2.2781
0.0225 0.96 14700 0.0239 -0.0415 -0.1267 0.6605 0.0852 -236.9473 -240.3016 -2.0895 -2.2774
0.0099 0.97 14800 0.0239 -0.0415 -0.1268 0.6595 0.0853 -236.9750 -240.3092 -2.0891 -2.2771
0.0235 0.97 14900 0.0239 -0.0415 -0.1268 0.6585 0.0853 -236.9760 -240.2991 -2.0898 -2.2777
0.019 0.98 15000 0.0239 -0.0415 -0.1267 0.6610 0.0852 -236.9527 -240.3060 -2.0899 -2.2778
0.0368 0.99 15100 0.0239 -0.0415 -0.1267 0.6605 0.0852 -236.9458 -240.2961 -2.0904 -2.2784
0.0267 0.99 15200 0.0239 -0.0414 -0.1265 0.6580 0.0851 -236.9213 -240.2912 -2.0899 -2.2778

Framework versions

  • PEFT 0.7.1
  • Transformers 4.36.2
  • Pytorch 2.1.2+cu121
  • Datasets 2.14.6
  • Tokenizers 0.15.2