Update README.md
Browse files
README.md
CHANGED
@@ -9,12 +9,15 @@ design to provide assistance to developers who are using [gradio](https://www.gr
|
|
9 |
# Dataset
|
10 |
The dataset is multi-source. Its content comes from the following sources
|
11 |
- The stack
|
|
|
12 |
More precisely, we looked into [the-stack-dedup](https://huggingface.co/datasets/bigcode/the-stack-dedup) which contain codes permissive licenses. We shortlisted the files whose
|
13 |
content incorporated the keyword `gradio`.
|
14 |
- GitHub Issues
|
|
|
15 |
We scrapped all the issues of the official repository [the-gradio-app/gradio](https://github.com/gradio-app/gradio) and added them to our training dataset.
|
16 |
- Spaces on Hugging Face Hub
|
17 |
-
|
|
|
18 |
with permissive licenses, namely MIT and Apache 2.0. This set of code was further deduplicated.
|
19 |
|
20 |
# Training setting and hyperparameters
|
|
|
9 |
# Dataset
|
10 |
The dataset is multi-source. Its content comes from the following sources
|
11 |
- The stack
|
12 |
+
|
13 |
More precisely, we looked into [the-stack-dedup](https://huggingface.co/datasets/bigcode/the-stack-dedup) which contain codes permissive licenses. We shortlisted the files whose
|
14 |
content incorporated the keyword `gradio`.
|
15 |
- GitHub Issues
|
16 |
+
|
17 |
We scrapped all the issues of the official repository [the-gradio-app/gradio](https://github.com/gradio-app/gradio) and added them to our training dataset.
|
18 |
- Spaces on Hugging Face Hub
|
19 |
+
|
20 |
+
We used the [HuggingFace_Hub API](https://huggingface.co/docs/huggingface_hub/package_reference/hf_api) to scrape the data from the spaces which are designed with gradio. We kept track of those
|
21 |
with permissive licenses, namely MIT and Apache 2.0. This set of code was further deduplicated.
|
22 |
|
23 |
# Training setting and hyperparameters
|