pavel_tolstyko / README.md
pavel-tolstyko's picture
Create README.md
d27481c verified
|
raw
history blame
970 Bytes
metadata
base_model:
  - TinyLlama/TinyLlama-1.1B-Chat-v1.0

Model Card

Model Description

This is a Large Language Model (LLM) trained on a dataset of DIBT/10k_prompts_ranked.

Evaluation Results

Hellaswag

Passed argument batch_size = auto:4.0. Detecting largest batch size Determined largest batch size: 64 Passed argument batch_size = auto:4.0. Detecting largest batch size Determined largest batch size: 64 hf (pretrained=EleutherAI/pythia-160m,revision=step100000,dtype=float), gen_kwargs: (None), limit: None, num_fewshot: None, batch_size: auto:4 (64,64,64,64,64)

Tasks Version Filter n-shot Metric Value Stderr
hellaswag 1 none 0 acc 0.2872 ± 0.0045
none 0 acc_norm 0.3082 ± 0.0046

How to Use

To use this model, simply download the checkpoint and load it into your preferred deep learning framework.