Model Card for Model ID

This is the Llama-3.2-1B model fine-tuned on a classification task using the SST-2 sentiment analysis dataset (both the model and the dataset were downloaded from ๐Ÿค— with proper permissions and authorization). This was done as a learning activity in the course CS613: Natural Language Processing, offered at Indian Institute of Technology Gandhinagar in the Autumn 2024 term.

Pipeline Followed for Fine-Tuning:

  1. Downloaded the Llama-3.2-1B model and its tokenizer from ๐Ÿค—.
  2. Downloaded the SST-2 dataset from ๐Ÿค—. Split it with a custom train-test split of 80:20.
  3. Tokenized the dataset.
  4. Added a classification layer to the model to make it suitable for classification (using AutoModelForSequenceClassification).
  5. Evaluated the performance of the pre-fine-tuned model on the tokenized test data.
  6. Fine-tuned the model using the tokenized data and evaluated its performance.
  7. Compared the performance pre- and post-fine-tuning.
Downloads last month
78
Safetensors
Model size
1.24B params
Tensor type
F32
ยท
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.