Training complete

Files changed (3) hide show

README.md CHANGED Viewed

@@ -13,10 +13,10 @@ should probably proofread and complete it, then remove this comment. -->
 # mamba_text_classification
-This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8755
-- Accuracy: 0.7778
 ## Model description
@@ -35,28 +35,28 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
-- train_batch_size: 2
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.01
-- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 2.0279 | 0.1 | 371 | 2.5090 | 0.1111 |
-| 1.3557 | 0.2 | 742 | 1.0745 | 0.6667 |
-| 1.5255 | 0.3 | 1113 | 1.1685 | 0.6667 |
-| 0.7389 | 0.4 | 1484 | 1.3240 | 0.5556 |
-| 0.9114 | 0.5 | 1855 | 1.2930 | 0.6667 |
-| 0.0422 | 0.6 | 2226 | 1.1987 | 0.6667 |
-| 1.5648 | 0.7 | 2597 | 0.5782 | 0.7778 |
-| 1.7356 | 0.8 | 2968 | 0.7707 | 0.6667 |
-| 0.0145 | 0.9 | 3339 | 0.8755 | 0.7778 |
 ### Framework versions

 # mamba_text_classification
+This model was trained from scratch on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6274
+- Accuracy: 1.0
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-06
+- train_batch_size: 4
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.01
+- num_epochs: 4
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.6169 | 0.4 | 52 | 0.8997 | 0.0 |
+| 0.698 | 0.81 | 104 | 0.7669 | 0.0 |
+| 0.5953 | 1.21 | 156 | 0.6956 | 0.0 |
+| 0.5979 | 1.61 | 208 | 0.6580 | 1.0 |
+| 0.5949 | 2.02 | 260 | 0.6465 | 1.0 |
+| 0.6608 | 2.42 | 312 | 0.6321 | 1.0 |
+| 0.5082 | 2.82 | 364 | 0.6339 | 1.0 |
+| 0.578 | 3.22 | 416 | 0.6302 | 1.0 |
+| 0.6325 | 3.63 | 468 | 0.6274 | 1.0 |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:80c2f54576acbd5a4b54a1cf2fffa8bcf687f5286f04a290b9a50d1bafd27b07
 size 516640282

 version https://git-lfs.github.com/spec/v1
+oid sha256:afad85180d6c7b577c15d78afba6909e76b495afa89e4b6f361ff91ca68fca65
 size 516640282

tokenizer.json CHANGED Viewed

@@ -1,11 +1,6 @@
 {
  "version": "1.0",
- "truncation": {
- "direction": "Right",
- "max_length": 512,
- "strategy": "LongestFirst",
- "stride": 0
- },
  "padding": null,
  "added_tokens": [
  {

 {
  "version": "1.0",
+ "truncation": null,
  "padding": null,
  "added_tokens": [
  {