Qwen/Qwen2-VL-2B-Instruct · Hardware requirements

Sep 11

Please,

Can someone tell me what GPU hardware (vRam) is needed to load Qwen-VL-2B? Because when I look, it's only 4 GB, but I get an OUT OF MEMORY error even though I'm using a Colab with 15 GB of vRam. Does anyone have any experience to share or suggestions?

DebopamC

Sep 14

Same issue. Cannot seem to figure it out

wjbmattingly

Sep 16

Do you get the error when loading the model or when running inference?

Aminrez

Sep 20

Same experience here. Couldn't get it running on RTX 4070 with 16 GB of Ram, couldn't also get it going on Colab with 15GB of vRam.

CHNtentes

Sep 23

•

edited Sep 23

#The default range for the number of visual tokens per image in the model is 4-16384. You can set min_pixels and max_pixels according to your needs, such as a token count range of 256-1280, to balance speed and memory usage.
min_pixels = 256*28*28
max_pixels = 1280*28*28
processor = AutoProcessor.from_pretrained("Qwen/Qwen2-VL-2B-Instruct", min_pixels=min_pixels, max_pixels=max_pixels)

Try set max_pixels when init processor. Otherwise it will use ton of vram when dealing with high res images.

wangweiak

1 day ago

thx

DILLIP-KUMAR

1 day ago

Your GPU with 15GB VRAM is generally sufficient for running Qwen2-VL-2B. The OUT OF MEMORY error might be caused by the image size or model precision settings. Try resizing the image to a maximum resolution of 1080 or1280 pixels on the longer edge while maintaining the aspect ratio. Additionally, load the model using torch_dtype=torch.float16 to reduce memory usage. These adjustments should help resolve the issue.