Update README.md
Browse files
README.md
CHANGED
@@ -27,4 +27,17 @@ augmxnt/shisa-7b-v1
|
|
27 |
* Japanese Law Precedent Dataset
|
28 |
* Japanese Wikipedia
|
29 |
* .lg.jp, .go.jp, .ac.jp domain webscrapes from CulturaX (Any documents with same first 25 characters were de-duplicated)
|
30 |
-
* English Ultrachat200K-gen (So that it doesn't forget English and chatting ability learned in the base checkpoint)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
27 |
* Japanese Law Precedent Dataset
|
28 |
* Japanese Wikipedia
|
29 |
* .lg.jp, .go.jp, .ac.jp domain webscrapes from CulturaX (Any documents with same first 25 characters were de-duplicated)
|
30 |
+
* English Ultrachat200K-gen (So that it doesn't forget English and chatting ability learned in the base checkpoint)
|
31 |
+
|
32 |
+
# Developed by
|
33 |
+
|
34 |
+
### Engineers
|
35 |
+
Peter Devine
|
36 |
+
Sho Higuchi
|
37 |
+
|
38 |
+
### Advisors
|
39 |
+
Yuuki Yamanaka
|
40 |
+
Atom Sonoda
|
41 |
+
|
42 |
+
### Dataset evaluator
|
43 |
+
Renju Aoki
|