Jianshu Zhang's picture

Jianshu Zhang

Sterzhang

·

https://sterzhang.github.io/

AI & ML interests

Data-Centric AI, Multi-Modal Understanding

Recent Activity

updated a dataset 9 days ago

Sterzhang/P-Bench-Choice

updated a dataset 26 days ago

Sterzhang/vlm2-bench

new activity 26 days ago

Sterzhang/vlm2-bench:delete dataset for updated version

View all activity

Organizations

Sterzhang's activity

New activity in Sterzhang/vlm2-bench 26 days ago

delete dataset for updated version

#6 opened 27 days ago by

delete for new update of VLM2Bench image cases

#5 opened 27 days ago by

delete for new update of VLM2Bench image cases

#4 opened 27 days ago by

New activity in Sterzhang/vlm2-bench about 1 month ago

Delete VLM2Bench_VID.tsv

#3 opened about 1 month ago by

commented a paper about 1 month ago

VLM$^2$-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues

Paper • 2502.12084 • Published Feb 17 • 29 •

New activity in Sterzhang/vlm2-bench about 1 month ago

refined readme

#2 opened about 1 month ago by

New activity in Sterzhang/vlm2-bench about 2 months ago

Upload vlm2-bench_dataset.zip

#1 opened about 2 months ago by

commented a paper 3 months ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 107 •

New activity in luoruipu1/Valley-Instruct-65k 4 months ago

How to download all the video?

#1 opened 4 months ago by

New activity in lmms-lab/LLaVA-Video-178K 5 months ago

Query about how many frames are used to generate each caption?

#7 opened 5 months ago by

New activity in Sterzhang/PVIT-3M 5 months ago

P-Bench?

#2 opened 5 months ago by

machinelearningmonk

New activity in Sterzhang/image-textualization 8 months ago

update readme with datasheet

#2 opened 8 months ago by