dev-slx commited on
Commit
e8c7347
1 Parent(s): 567ec55

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -28,9 +28,11 @@ _Fast Inference with Customization:_ Once trained, the ELM model architecture pe
28
  ## ELM-v0.1 Model Release
29
  This repository contains code to run our ELM models. The current ELM model `elm-v0.1` (named _Rambutan_) was pre-trained (an intermediate checkpoint was used) and then instruction fine-tuned for downstream tasks.
30
 
31
- Models are located in the `models` folder. ELM models in this repository comes in three sizes (elm-1.0, elm-0.75 and elm-0.25) and supports the following use-case.
32
  - news_summarization (cnn dailymail dataset)
33
 
 
 
34
 
35
  ## Setup ELM
36
  ### Download ELM repo
 
28
  ## ELM-v0.1 Model Release
29
  This repository contains code to run our ELM models. The current ELM model `elm-v0.1` (named _Rambutan_) was pre-trained (an intermediate checkpoint was used) and then instruction fine-tuned for downstream tasks.
30
 
31
+ ELM models (in the `models` folder) in this repository come in three sizes (elm-1.0, elm-0.75 and elm-0.25). **All these different slices are extracted from the same ELM finetuned checkpoint for inference** and supports the following use-case.
32
  - news_summarization (cnn dailymail dataset)
33
 
34
+ **NOTE: ELM-v0.1 release is an early version finetuned from an intermediate pretrained checkpoint & without any KV caching, decoding optimizations, or quantization applied.**
35
+
36
 
37
  ## Setup ELM
38
  ### Download ELM repo