loubnabnl HF staff commited on
Commit
8ee26fa
·
1 Parent(s): 85e5014

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -2
README.md CHANGED
@@ -7,9 +7,49 @@ sdk: static
7
  pinned: false
8
  ---
9
  <p>
10
- <img src="https://huggingface.co/datasets/loubnabnl/repo-images/resolve/main/codeparrot_logo.png" alt="drawing" width="440"/>
11
  </p>
12
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  <p>This organization is dedicated to language models for code generation. In particular CodeParrot is a GPT-2 model trained to generate Python code. Here you can find:</p>
14
 
15
  <ul>
 
7
  pinned: false
8
  ---
9
  <p>
10
+ <img src="https://huggingface.co/datasets/loubnabnl/repo-images/resolve/main/codeparrot2.png" alt="drawing" width="440"/>
11
  </p>
12
+ <div style="
13
+ background-color: #ffcc00;
14
+ color: black;
15
+ padding: 10px;
16
+ margin-bottom: 20px;
17
+ text-align: center;
18
+ ">
19
+ ⚠️ NEW! Code instruction tuning datasets, models and benchmarks.
20
+ </div>
21
+ <p>Check the new instruction-tuning resources:</p>
22
+ <ul>
23
+ <li>
24
+ <p>
25
+ <b>InstructHumanEval: </b>a variant of HumanEval benchamrk adapted for instruction-tuned models<a
26
+ href="https://huggingface.co/datasets/codeparrot/instructhumaneval"
27
+ class="underline"> InstructHumanEval</a
28
+ >
29
+ </p></li>
30
+ <li>
31
+ <p>
32
+ <b>Full Curated CoNaLa: </b>we used UL2 to rewritte more than 590k uncurated intents in CoNaLa dataset<a
33
+ href="https://huggingface.co/datasets/codeparrot/conala-mined-curated"
34
+ class="underline"> conala-mined-curated</a
35
+ >
36
+ </p></li>
37
+ <li>
38
+ <p>
39
+ <b>Self-Instruct with StarCoder: </b>we release a selft-instruct dataset generated with StarCoder, as weel as the code we used to build it<a
40
+ href="https://huggingface.co/datasets/codeparrot/self-instruct-starcoder"
41
+ class="underline"> self-instruct-starcoder</a
42
+ >
43
+ </p>
44
+ <li>
45
+ <p>
46
+ <b>Models trained on CoNaLa and self-instruct StarCoder: </b>we release a the models we trained on the previous two datasets.
47
+ >
48
+ </p>
49
+ </li>
50
+
51
+
52
+ <hr>
53
  <p>This organization is dedicated to language models for code generation. In particular CodeParrot is a GPT-2 model trained to generate Python code. Here you can find:</p>
54
 
55
  <ul>