theprint commited on
Commit
4c98937
·
verified ·
1 Parent(s): 9952de2

Update README.md

Browse files

Information about the dataset.

Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -13,6 +13,21 @@ language:
13
  datasets:
14
  - theprint/VanRossum-GPT
15
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
16
 
17
  # Uploaded model
18
 
 
13
  datasets:
14
  - theprint/VanRossum-GPT
15
  ---
16
+ # Homage to Python
17
+
18
+ The VanRossum dataset is all Python! I used [DataMix](https://github.com/theprint/DataMix) to combine a handful of highly rated Python-centric datasets, to get a sampling of each and create something new.
19
+
20
+ This data set has **80,000 entries** and is named after [**Guido Van Rossum**](https://en.wikipedia.org/wiki/Guido_van_Rossum), the man who invented Python back in 1991.
21
+
22
+ See the [VanRossum Collection](https://huggingface.co/collections/theprint/vanrossum-67363abb2d3459644d7fd102) on HF for all things related to this dataset.
23
+
24
+ ## Alpaca / GPT
25
+
26
+ There are 2 versions of this dataset available on Huggingface.
27
+
28
+ - [VanRossum-GPT](https://huggingface.co/datasets/theprint/VanRossum-GPT)
29
+ - [VanRossum-Alpaca](https://huggingface.co/datasets/theprint/VanRossum-Alpaca)
30
+
31
 
32
  # Uploaded model
33