Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
DAMO-NLP-SG
/
VideoRefer-7B-stage2.5
like
2
Follow
Language Technology Lab at Alibaba DAMO Academy
60
Visual Question Answering
Transformers
Safetensors
English
videorefer_qwen2
text-generation
multimodal large language model
large video-language model
Inference Endpoints
arxiv:
2406.07476
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
VideoRefer-7B-stage2.5
Commit History
Update README.md
38e9c97
verified
CircleRadon
commited on
11 days ago
Update config.json
b5aa872
verified
CircleRadon
commited on
11 days ago
Upload tokenizer
1601549
verified
CircleRadon
commited on
11 days ago
Upload VideoReferQwen2ForCausalLM
8c1ade9
verified
CircleRadon
commited on
11 days ago
initial commit
212fedf
verified
CircleRadon
commited on
11 days ago