File size: 7,390 Bytes

e9fe353
 
 
7427532
 
e9fe353
78a1f28
e9fe353
 
 
 
78a1f28
 
 
f19669e
78a1f28
e9fe353
 
f19669e
37fea72
e9fe353
 
 
78a1f28
 
e9fe353
 
78a1f28
e9fe353
 
78a1f28
e9fe353
 
78a1f28
 
 
e9fe353
 
78a1f28
 
e9fe353
78a1f28
 
e9fe353
78a1f28
ef4100c
 
78a1f28
e9fe353
ef4100c
 
e9fe353
ef4100c
 
 
 
 
 
 
 
e9fe353
ef4100c
 
78a1f28
ef4100c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
37fea72
e9fe353
 
 
78a1f28
e9fe353
 
 
 
78a1f28
 
e9fe353
78a1f28
 
 
 
 
 
e9fe353
 
 
 
 
78a1f28
 
e9fe353
78a1f28
 
e9fe353
78a1f28
 
 
e9fe353
78a1f28
 
e9fe353
78a1f28
 
 
 
e9fe353
 
78a1f28
 
7427532
78a1f28
 
e9fe353
78a1f28
e9fe353
78a1f28
 
e9fe353
78a1f28
 
e9fe353
78a1f28
7427532
 
e9fe353
78a1f28
 
 
e9fe353
78a1f28
e9fe353
78a1f28
e9fe353
78a1f28
 
 
 
 
 
 
 
 
 
e9fe353
78a1f28
 
e9fe353
78a1f28
6ddabd3
e9fe353
 
78a1f28

---
base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
library_name: peft
datasets:
- sanaa-11/math-dataset
---
# Model Card for LLaMA 3.1 Fine-Tuned Model

## Model Details

### Model Description
- **Developed by**: Sanaa Abril
- **Model Type**: Fine-tuned Causal Language Model
- **Language(s) (NLP)**: French
- **License**:
- **Finetuned from model**: Meta LLaMA 3.1 8B Instruct

### Model Sources [optional]
- **Repository**: https://huggingface.co/sanaa-11/mathematic-exercice-generator/tree/main
- 
## Uses

### Direct Use
- **Primary Application**: This model is primarily used for generating math exercises tailored to Moroccan students in French, based on specific lessons and difficulty levels.
- **Example Use Case**: Educators can input lesson topics to generate corresponding exercises for classroom use or online learning platforms.

### Downstream Use [optional]
- **Potential Applications**: The model can be extended or adapted to create exercises in other languages or for different educational levels.

### Out-of-Scope Use
- **Not Suitable For**: The model is not designed for high-stakes assessments, as it may generate exercises that require further validation by subject matter experts.

## Bias, Risks, and Limitations
- **Bias**: The model may inherit biases from the data it was trained on, potentially generating exercises that reflect unintended cultural or linguistic biases.
- **Risks**: There is a risk of generating mathematically incorrect exercises or exercises that do not align with the intended curriculum.
- **Limitations**: The model's accuracy and relevance may decrease when generating exercises outside of its training domain or when applied to advanced mathematical topics not covered during fine-tuning.

### Recommendations
- **For Educators**: It is recommended to review the generated exercises for correctness and relevance before using them in a classroom setting.
- **For Developers**: Fine-tune the model further or adjust the training data to mitigate any biases and improve the quality of the generated content.

## How to Get Started with the Model
Use the following code snippet to load and generate exercises using the model:

```python
from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
from peft import PeftModel, PeftConfig
import torch

# Base model name
model_name = "meta-llama/Meta-Llama-3.1-8B-Instruct"

# Load the base model without specifying rope_scaling
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    device_map="auto",  # Adjust based on your environment
    offload_folder="./offload_dir",  # Specify a folder for offloading if necessary
    torch_dtype=torch.float16,  # Use float16 for better performance on compatible hardware
    revision="main"  # Specify the correct revision if needed
)

# Load the adapter configuration
config = PeftConfig.from_pretrained("sanaa-11/mathematic-exercice-generator")

# Load the adapter weights into the model
model = PeftModel.from_pretrained(model, "sanaa-11/mathematic-exercice-generator")

# Load the tokenizer
tokenizer = AutoTokenizer.from_pretrained(model_name, use_fast=True)
```
```
generated_text = ""
prompt = "Fournis un exercice basé sur la vie reelle de difficulté moyenne de niveau 2 annee college sur les fractions."
for _ in range(5):  
    inputs = tokenizer(prompt + generated_text, return_tensors="pt").to(model.device)
    outputs = model.generate(
        **inputs,
        max_length=1065,  
        temperature=0.7,
        top_p=0.9,
        num_beams=5,
        repetition_penalty=1.2,
        no_repeat_ngram_size=2,
        pad_token_id=tokenizer.eos_token_id,
        early_stopping=False
)
    new_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
    generated_text += new_text
    print(new_text)
```
## Training Details

### Training Data
- **Dataset**: The model was fine-tuned on a custom dataset consisting of 11,106 rows of math exercises, lesson content, and solutions, specifically designed for Moroccan students in French.

### Training Procedure

#### Preprocessing [optional]
- **Data Cleaning**: Text normalization, tokenization, and padding were applied to prepare the data.
- **Tokenization**: The French tokenizer provided by Hugging Face was used to process the text data.

### Training Hyperparameters
- **Training Regime**: The model was fine-tuned using 4-bit quantization with QLoRA to optimize GPU and RAM usage. The training was performed on a Kaggle environment with limited resources.
- **Batch Size**: 1 (with gradient accumulation steps of 8)
- **Number of Epochs**: 10
- **Learning Rate**: 5e-5
- **Optimizer**: AdamW

## Evaluation

### Testing Data, Factors & Metrics

**Testing Data**
- A separate subset of 10% of the dataset was reserved for evaluation.

**Factors**
- **Complexity of Generated Exercises**: Exercises were evaluated based on their complexity relative to the intended difficulty level.

**Metrics**
- **Training Loss**: The loss measured during training.
- **Validation Loss**: The loss measured on the validation dataset during training.

**Results**
- **Training and Validation Loss**: The model was evaluated based on training and validation loss over 8 epochs. The results indicate that the model's performance improved significantly after the first few epochs, with a steady decrease in both training and validation loss. The final validation loss achieved was 0.154888, indicating a good fit to the validation data without significant overfitting.

### Summary
**Model Examination**
- The model demonstrated a consistent reduction in both training and validation loss across the training epochs, suggesting effective learning and generalization from the provided dataset.
- While F1 score and perplexity were not used in this evaluation, the training and validation losses provide a strong indication of the model's performance and its potential for generating accurate and relevant math exercises.

## Environmental Impact
**Carbon Emissions**
- **Hardware Type**: Tesla T4 GPU
- **Hours Used**: 12 hours
- **Cloud Provider**: Kaggle
- **Carbon Emitted**: [Can be estimated using the Machine Learning Impact calculator by Lacoste et al. (2019)]

### Technical Specifications [optional]

**Model Architecture and Objective**
- The model is based on the LLaMA 3.1 architecture, fine-tuned to generate text in French for educational purposes, specifically math exercises.

**Compute Infrastructure**
- The model was trained on Kaggle’s free-tier environment, leveraging a single Tesla T4 GPU.

**Hardware**
- **GPU**: Tesla T4 with 16GB RAM


**Software**
- **Transformers Version**: 4.44.0
- **PEFT Version**: 0.12.0

### Citation [optional]

**BibTeX**:

```bibtex
@misc{your_name_2024_model,
  author = {Sanaa Abril},
  title = {Fine-Tuned LLaMA 3.1 for Generating Math Exercises},
  year = {2024},
  publisher = {Hugging Face},
  note = {\url{https://huggingface.co/sanaa-11/mathematic-exercice-generator}}
}
**APA**:
Abril, S. (2024). Fine-Tuned LLaMA 3.1 for Generating Math Exercises. Hugging Face. https://huggingface.co/sanaa-11/mathematic-exercice-generator

### More Information [optional]
- For further details or questions, feel free to reach out to the model card authors.

### Model Card Authors [optional]
- **Sanaa Abril** - [email protected]

### Framework versions
- **Transformers**: 4.44.0
- **PEFT**: 0.12.0