File size: 1,353 Bytes
409512a 488e290 409512a 488e290 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 |
---
library_name: peft
base_model: mistralai/Mistral-7B-v0.1
---
# Model Card for Model ID
Trained with [Ludwig.ai](https://ludwig.ai) and [Prdibase](https://predibase.com)!
Given a passage from a news report generates a headline.
Trained on: https://huggingface.co/datasets/JulesBelveze/tldr_news
## Model Details
### Model Description
Ludwig config (v0.9.3):
```yaml
model_type: llm
input_features:
- name: prompt
type: text
preprocessing:
max_sequence_length: null
column: prompt
output_features:
- name: headline
type: text
preprocessing:
max_sequence_length: null
column: headline
prompt:
template: >-
The following passage is content from a news report. Please summarize this
passage in one sentence or less.
Passage: {content}
Summary:
preprocessing:
split:
type: fixed
column: split
global_max_sequence_length: 2048
adapter:
type: lora
generation:
max_new_tokens: 64
trainer:
type: finetune
epochs: 3
optimizer:
type: paged_adam
batch_size: 1
eval_steps: 100
learning_rate: 0.0002
eval_batch_size: 2
steps_per_checkpoint: 1000
learning_rate_scheduler:
decay: cosine
warmup_fraction: 0.03
gradient_accumulation_steps: 16
enable_gradient_checkpointing: true
base_model: mistralai/Mistral-7B-v0.1
quantization:
bits: 4
```
|