Haoxiang-Wang
commited on
Commit
•
f036ed7
1
Parent(s):
6e0f90e
Update README.md
Browse files
README.md
CHANGED
@@ -284,7 +284,7 @@ Model Type:
|
|
284 |
Intended Users: | Generative AI developers for image and video generation models
|
285 |
Output: | Images/Videos and Latent Tokens
|
286 |
Describe how the model works: | Compresses and decompresses visual input (image/video).
|
287 |
-
Technical Limitations: |
|
288 |
Verified to have met prescribed NVIDIA quality standards: | Yes
|
289 |
Performance Metrics: | Peak Signal-to-Noise Ratio (PSNR), Structural Similarity (SSIM), Reconstruction Fréchet Video Distance (rFVD), Reconstruction Fréchet Inception Distance (rFID), Latency
|
290 |
Potential Known Risks: | Tokenizer's output can parse all forms of input, including what may be considered toxic, offensive, or indecent.
|
|
|
284 |
Intended Users: | Generative AI developers for image and video generation models
|
285 |
Output: | Images/Videos and Latent Tokens
|
286 |
Describe how the model works: | Compresses and decompresses visual input (image/video).
|
287 |
+
Technical Limitations: | Due to tokenizer compression limitations, some visual information (such as small text and other structured fine details) may not be reconstructed accurately.
|
288 |
Verified to have met prescribed NVIDIA quality standards: | Yes
|
289 |
Performance Metrics: | Peak Signal-to-Noise Ratio (PSNR), Structural Similarity (SSIM), Reconstruction Fréchet Video Distance (rFVD), Reconstruction Fréchet Inception Distance (rFID), Latency
|
290 |
Potential Known Risks: | Tokenizer's output can parse all forms of input, including what may be considered toxic, offensive, or indecent.
|