spidogan commited on
Commit
b600607
·
verified ·
1 Parent(s): 460c134

Added small model with RoPE and MHLA

Browse files
Files changed (3) hide show
  1. README.md +0 -4
  2. config.json +5 -5
  3. model.safetensors +2 -2
README.md CHANGED
@@ -2,10 +2,6 @@
2
  tags:
3
  - model_hub_mixin
4
  - pytorch_model_hub_mixin
5
- datasets:
6
- - IgorVolochay/russian_jokes
7
- language:
8
- - ru
9
  ---
10
 
11
  This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
 
2
  tags:
3
  - model_hub_mixin
4
  - pytorch_model_hub_mixin
 
 
 
 
5
  ---
6
 
7
  This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
config.json CHANGED
@@ -1,10 +1,10 @@
1
  {
2
  "dropout": 0.1,
3
- "hidden_dim": 96,
4
- "intermediate_dim": 256,
5
  "max_seq_len": 128,
6
- "n_head": 4,
7
- "n_kv_head": 2,
8
- "n_layer": 3,
9
  "vocab_size": 1024
10
  }
 
1
  {
2
  "dropout": 0.1,
3
+ "hidden_dim": 768,
4
+ "intermediate_dim": 3072,
5
  "max_seq_len": 128,
6
+ "n_head": 12,
7
+ "n_kv_head": 12,
8
+ "n_layer": 12,
9
  "vocab_size": 1024
10
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:17701fdda46da06d5f7d042a73f410914a526191769ca3f74f0edb85f903eb17
3
- size 1925360
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4eccc9d9d3b8b35ca109573b0a8c96b23cd077f942bc6b20515bb06267855a1
3
+ size 460642712