testjpth

This model is a fine-tuned version of facebook/nllb-200-distilled-600M on the None dataset.

Model description

This is test version to translate Japanese to Thai. I use NLLB for this model.

Intended uses & limitations

This is just for the test concept of NLLB model

Training and evaluation data

The data was generated by other model. The dataset was split by intention to use in order to make the model understand some technical term.

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 1
  • eval_batch_size: 1
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 1

Framework versions

  • Transformers 4.30.2
  • Pytorch 2.0.1+cu118
  • Datasets 2.13.1
  • Tokenizers 0.13.3
Downloads last month
7
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.