Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
v1v1d
/
vivid_base
like
0
Follow
ViViD
3
Image-Text-to-Text
Transformers
Safetensors
multilingual
GOT
feature-extraction
got
vision-language
ocr2.0
custom_code
arxiv:
2409.01704
arxiv:
2405.14295
arxiv:
2312.06109
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Use this model
main
vivid_base
1 contributor
History:
2 commits
AdithyaSK
Upload folder using huggingface_hub
3bfff56
verified
2 months ago
assets
Upload folder using huggingface_hub
2 months ago
.gitattributes
Safe
1.52 kB
initial commit
2 months ago
README.md
Safe
3.99 kB
Upload folder using huggingface_hub
2 months ago
config.json
Safe
986 Bytes
Upload folder using huggingface_hub
2 months ago
generation_config.json
Safe
117 Bytes
Upload folder using huggingface_hub
2 months ago
got_vision_b.py
Safe
16.1 kB
Upload folder using huggingface_hub
2 months ago
model.safetensors
Safe
1.43 GB
LFS
Upload folder using huggingface_hub
2 months ago
modeling_GOT.py
Safe
33.8 kB
Upload folder using huggingface_hub
2 months ago
qwen.tiktoken
Safe
2.55 MB
Upload folder using huggingface_hub
2 months ago
qwen_original.tiktoken
Safe
2.56 MB
Upload folder using huggingface_hub
2 months ago
render_tools.py
Safe
1.99 kB
Upload folder using huggingface_hub
2 months ago
special_tokens_map.json
Safe
149 Bytes
Upload folder using huggingface_hub
2 months ago
tokenisation.ipynb
Safe
23 kB
Upload folder using huggingface_hub
2 months ago
tokenization_qwen.py
Safe
10.2 kB
Upload folder using huggingface_hub
2 months ago
tokenization_qwen_original.py
Safe
10.1 kB
Upload folder using huggingface_hub
2 months ago
tokenizer_config.json
Safe
300 Bytes
Upload folder using huggingface_hub
2 months ago