Update README.md
Browse files
README.md
CHANGED
@@ -15,9 +15,9 @@ model-index:
|
|
15 |
|
16 |
The TVC models are 7B parameter models based on Qwen2-VL-7B-Instruct model with a context window of 8K tokens.
|
17 |
|
18 |
-
- **Repository:** https://github.com/
|
19 |
- **Languages:** English, Chinese
|
20 |
-
- **Paper:** https://arxiv.org/abs/
|
21 |
|
22 |
|
23 |
### Model Architecture
|
@@ -40,3 +40,12 @@ The TVC models are 7B parameter models based on Qwen2-VL-7B-Instruct model with
|
|
40 |
- Tokenizers 0.20.3
|
41 |
|
42 |
## Citation
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
15 |
|
16 |
The TVC models are 7B parameter models based on Qwen2-VL-7B-Instruct model with a context window of 8K tokens.
|
17 |
|
18 |
+
- **Repository:** https://github.com/sun-hailong/TVC
|
19 |
- **Languages:** English, Chinese
|
20 |
+
- **Paper:** https://arxiv.org/abs/2503.13360
|
21 |
|
22 |
|
23 |
### Model Architecture
|
|
|
40 |
- Tokenizers 0.20.3
|
41 |
|
42 |
## Citation
|
43 |
+
|
44 |
+
```
|
45 |
+
@article{sun2024mitigating,
|
46 |
+
title={Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning},
|
47 |
+
author={Sun, Hai-Long and Sun, Zhun and Peng, Houwen and Ye, Han-Jia},
|
48 |
+
journal={arXiv preprint arXiv:2503.13360},
|
49 |
+
year={2025}
|
50 |
+
}
|
51 |
+
```
|