Model Card for Model ID
This is the Llama-3.2-1B model fine-tuned on a classification task using the SST-2 sentiment analysis dataset (both the model and the dataset were downloaded from ๐ค with proper permissions and authorization). This was done as a learning activity in the course CS613: Natural Language Processing, offered at Indian Institute of Technology Gandhinagar in the Autumn 2024 term.
Pipeline Followed for Fine-Tuning:
- Downloaded the Llama-3.2-1B model and its tokenizer from ๐ค.
- Downloaded the SST-2 dataset from ๐ค. Split it with a custom train-test split of 80:20.
- Tokenized the dataset.
- Added a classification layer to the model to make it suitable for classification (using AutoModelForSequenceClassification).
- Evaluated the performance of the pre-fine-tuned model on the tokenized test data.
- Fine-tuned the model using the tokenized data and evaluated its performance.
- Compared the performance pre- and post-fine-tuning.
- Downloads last month
- 78
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.