Allen8
/

TVC-7B

Image-Text-to-Text

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

Allen8 commited on Mar 18

Commit

07840f2

·

verified ·

1 Parent(s): 947b929

Update README.md

Files changed (1) hide show

README.md +11 -2

README.md CHANGED Viewed

@@ -15,9 +15,9 @@ model-index:
 The TVC models are 7B parameter models based on Qwen2-VL-7B-Instruct model with a context window of 8K tokens.
-- **Repository:** https://github.com/xxx
 - **Languages:** English, Chinese
-- **Paper:** https://arxiv.org/abs/xxx
 ### Model Architecture
@@ -40,3 +40,12 @@ The TVC models are 7B parameter models based on Qwen2-VL-7B-Instruct model with
 - Tokenizers 0.20.3
 ## Citation

 The TVC models are 7B parameter models based on Qwen2-VL-7B-Instruct model with a context window of 8K tokens.
+- **Repository:** https://github.com/sun-hailong/TVC
 - **Languages:** English, Chinese
+- **Paper:** https://arxiv.org/abs/2503.13360
 ### Model Architecture
 - Tokenizers 0.20.3
 ## Citation
+```
+@article{sun2024mitigating,
+    title={Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning},
+    author={Sun, Hai-Long and Sun, Zhun and Peng, Houwen and Ye, Han-Jia},
+    journal={arXiv preprint arXiv:2503.13360},
+    year={2025}
+}
+```