aisingapore
/

llama3.1-70b-cpt-sea-lionv3-instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tainc commited on Dec 19, 2024

Commit

76a1f95

·

verified ·

1 Parent(s): af3b561

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -108,7 +108,7 @@ Current SEA-LION models, including this commercially permissive release, have no
 ## Technical Specifications
 ### Fine-Tuning Details
-Llama3.1 70B CPT SEA-LIONv3 Instruct was tuned using a combination of a full parameter fine-tune, on-policy alignment, and model merges of the best performing checkpoints. The training process for fine-tuning was approximately 1024 GPU hours, on a single node of 8x H100-80GB GPUs.
 ## Data
 Llama3.1 70B CPT SEA-LIONv3 Instruct was trained on a wide range of synthetic instructions, alongside publicly available instructions hand-curated by the team with the assistance of native speakers. In addition, special care was taken to ensure that the datasets used had commercially permissive licenses through verification with the original data sources.

 ## Technical Specifications
 ### Fine-Tuning Details
+Llama3.1 70B CPT SEA-LIONv3 Instruct was tuned using a combination of a full parameter fine-tune, on-policy alignment, and model merges of the best performing checkpoints. The training process for fine-tuning was approximately 3200 GPU hours, on a single node of 8x H100-80GB GPUs.
 ## Data
 Llama3.1 70B CPT SEA-LIONv3 Instruct was trained on a wide range of synthetic instructions, alongside publicly available instructions hand-curated by the team with the assistance of native speakers. In addition, special care was taken to ensure that the datasets used had commercially permissive licenses through verification with the original data sources.