ValiantLabs
/

Llama3.1-8B-Enigma

Model card Files Files and versions Community

Upload folder using huggingface_hub

by sequelbox - opened Sep 4, 2024

base: refs/heads/main

←

from: refs/pr/5

Discussion Files changed

+17

-18

Files changed (10) hide show

README.md +8 -4
config.json +1 -1
generation_config.json +1 -1
model-00001-of-00007.safetensors +1 -1
model-00002-of-00007.safetensors +1 -1
model-00003-of-00007.safetensors +1 -1
model-00004-of-00007.safetensors +1 -1
model-00005-of-00007.safetensors +1 -1
model-00006-of-00007.safetensors +1 -1
tokenizer.json +1 -6

README.md CHANGED Viewed

@@ -23,20 +23,24 @@ tags:
 base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
 datasets:
 - sequelbox/Tachibana
-- LDJnr/Pure-Dove
 model_type: llama
 license: llama3.1
 ---
 Enigma is a code-instruct model built on Llama 3.1 8b.
 - High quality code instruct performance within the Llama 3 Instruct chat format
 - Finetuned on synthetic code-instruct data generated with Llama 3.1 405b. [Find the current version of the dataset here!](https://huggingface.co/datasets/sequelbox/Tachibana)
 ## Version
-This is the **2024-08-10** release of Enigma for Llama 3.1 8b.
 Help us and recommend Enigma to your friends! We're excited for more Enigma releases in the future.
@@ -73,9 +77,9 @@ print(outputs[0]["generated_text"][-1])
 ```
 ## The Model
-Enigma is built on top of Llama 3.1 8b Instruct, using code-instruct data to supplement code-instruct performance using Llama 3.1 Instruct prompt style.
-Our current version of the Enigma code-instruct dataset is [sequelbox/Tachibana](https://huggingface.co/datasets/sequelbox/Tachibana), supplemented with a small selection of data from [LDJnr/Pure-Dove](https://huggingface.co/datasets/LDJnr/Pure-Dove) for general chat consistency.
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/VCJ8Fmefd8cdVhXSSxJiD.jpeg)

 base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
 datasets:
 - sequelbox/Tachibana
+- sequelbox/Supernova
 model_type: llama
 license: llama3.1
 ---
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64f267a8a4f79a118e0fcc89/it7MY5MyLCLpFQev5dUis.jpeg)
 Enigma is a code-instruct model built on Llama 3.1 8b.
 - High quality code instruct performance within the Llama 3 Instruct chat format
 - Finetuned on synthetic code-instruct data generated with Llama 3.1 405b. [Find the current version of the dataset here!](https://huggingface.co/datasets/sequelbox/Tachibana)
+- Overall chat performance supplemented with [generalist synthetic data.](https://huggingface.co/datasets/sequelbox/Supernova)
 ## Version
+This is the **2024-09-04** release of Enigma for Llama 3.1 8b, enhancing code-instruct and general chat capabilities.
 Help us and recommend Enigma to your friends! We're excited for more Enigma releases in the future.
 ```
 ## The Model
+Enigma is built on top of Llama 3.1 8b Instruct, using high quality code-instruct data and general chat data in Llama 3.1 Instruct prompt style to supplement overall performance.
+Our current version of Enigma is trained on code-instruct data from [sequelbox/Tachibana](https://huggingface.co/datasets/sequelbox/Tachibana) and general chat data from [sequelbox/Supernova.](https://huggingface.co/datasets/sequelbox/Supernova)
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/VCJ8Fmefd8cdVhXSSxJiD.jpeg)

config.json CHANGED Viewed

@@ -33,7 +33,7 @@
   "rope_theta": 500000.0,
   "tie_word_embeddings": false,
   "torch_dtype": "float32",
-  "transformers_version": "4.44.0",
   "use_cache": true,
   "vocab_size": 128256
 }

   "rope_theta": 500000.0,
   "tie_word_embeddings": false,
   "torch_dtype": "float32",
+  "transformers_version": "4.44.2",
   "use_cache": true,
   "vocab_size": 128256
 }

generation_config.json CHANGED Viewed

@@ -8,5 +8,5 @@
   ],
   "temperature": 0.6,
   "top_p": 0.9,
-  "transformers_version": "4.44.0"
 }

   ],
   "temperature": 0.6,
   "top_p": 0.9,
+  "transformers_version": "4.44.2"
 }

model-00001-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7e9e5664bfc422dbb7cca97eb819fa479b586b328be26d51971633ebba245ca0
 size 4886466168

 version https://git-lfs.github.com/spec/v1
+oid sha256:08dff18399f4082cd4af329673d8e5f05ba976529cd4b2fd3eaa8a198ad48a0c
 size 4886466168

model-00002-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a0b7fad220c567c9c95008ac7d0757ca2993ae7b83646ca524332d1eaa21f469
 size 4832007448

 version https://git-lfs.github.com/spec/v1
+oid sha256:93734a9e3f00cbded5bb06644d1a5a8a247c14f383e61ad49ad0c671e350f262
 size 4832007448

model-00003-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c7a1e5696beaed309e06ae8c093ee5641f502b373b414afa6de929b706df1341
 size 4999813112

 version https://git-lfs.github.com/spec/v1
+oid sha256:d58a3870696db28abed0917760f77a5cf11322674209263462ff08935f87ea7e
 size 4999813112

model-00004-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f9d982d47c65e3a0bf8136ab05cd82dab3a5029855837c14d599c02a0cf6dffe
 size 4999813128

 version https://git-lfs.github.com/spec/v1
+oid sha256:52bddb21602fc8f2ce0f79d011b6f6280dcf65f659142766b84f9a375524d364
 size 4999813128

model-00005-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1d9b103e8b6a9e2d283eabb59871ba708dd5d51f530e81e765f1ee6227ae9722
 size 4832007496

 version https://git-lfs.github.com/spec/v1
+oid sha256:6f67914bd40d6b748669032e06fd2c36eb83c5354da989052174a97270c3dd1b
 size 4832007496

model-00006-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c3ae2952e9f2312e2ec29c0797392c799ca3a1250c99db6d99df0b1c4a7b768d
 size 4999813120

 version https://git-lfs.github.com/spec/v1
+oid sha256:74a7c7fc0cc3074b7eb787ef4ede238dd0733565edadbb137c497615122e8080
 size 4999813120

tokenizer.json CHANGED Viewed

@@ -1,11 +1,6 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 5450,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": null,
   "padding": null,
   "added_tokens": [
     {