Spaces:
Running
Running
clarifications
Browse files- index.html +2 -1
- xorbs.json +1 -1
index.html
CHANGED
@@ -18,7 +18,8 @@
|
|
18 |
<body>
|
19 |
<div class="card">
|
20 |
<h1>Visualizing Repo-level Dedupe</h1>
|
21 |
-
<p>This visualization demonstrates the amount of <a target="_blank" rel="noopener noreferrer" href="https://huggingface.co/blog/from-files-to-chunks">chunk-level dedupe</a>
|
|
|
22 |
</div>
|
23 |
<div id="vis"></div>
|
24 |
<script>
|
|
|
18 |
<body>
|
19 |
<div class="card">
|
20 |
<h1>Visualizing Repo-level Dedupe</h1>
|
21 |
+
<p>This visualization demonstrates the amount of <a target="_blank" rel="noopener noreferrer" href="https://huggingface.co/blog/from-files-to-chunks">chunk-level dedupe</a> across all public repos.</p>
|
22 |
+
<p>"Dedupe factor" is defined as the number of re-uses of a given "xorb". A "xorb" is a collection of content-defined chunks, typically around 1,000 chunks comprising up to 64 MB of total data.</p>
|
23 |
</div>
|
24 |
<div id="vis"></div>
|
25 |
<script>
|
xorbs.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 629739
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:feaca61dcd2c355fa10b3a69d844299b798587574d985693b31234ed550b1a66
|
3 |
size 629739
|