tgaddair's picture
Upload 3 files
488e290 verified
|
raw
history blame
1.35 kB
---
library_name: peft
base_model: mistralai/Mistral-7B-v0.1
---
# Model Card for Model ID
Trained with [Ludwig.ai](https://ludwig.ai) and [Prdibase](https://predibase.com)!
Given a passage from a news report generates a headline.
Trained on: https://huggingface.co/datasets/JulesBelveze/tldr_news
## Model Details
### Model Description
Ludwig config (v0.9.3):
```yaml
model_type: llm
input_features:
- name: prompt
type: text
preprocessing:
max_sequence_length: null
column: prompt
output_features:
- name: headline
type: text
preprocessing:
max_sequence_length: null
column: headline
prompt:
template: >-
The following passage is content from a news report. Please summarize this
passage in one sentence or less.
Passage: {content}
Summary:
preprocessing:
split:
type: fixed
column: split
global_max_sequence_length: 2048
adapter:
type: lora
generation:
max_new_tokens: 64
trainer:
type: finetune
epochs: 3
optimizer:
type: paged_adam
batch_size: 1
eval_steps: 100
learning_rate: 0.0002
eval_batch_size: 2
steps_per_checkpoint: 1000
learning_rate_scheduler:
decay: cosine
warmup_fraction: 0.03
gradient_accumulation_steps: 16
enable_gradient_checkpointing: true
base_model: mistralai/Mistral-7B-v0.1
quantization:
bits: 4
```