Molmo-7B-D BnB 4bit quant 30GB -> 7GB

approx. 12GB VRAM required

base model for more information:

example code:

performance metrics & benchmarks to compare with base will follow over the next week

Safetensors

Model size

4.67B params

Tensor type

F32

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The HF Inference API does not support model that require custom code execution.

Model tree for cyan2k/molmo-7B-D-bnb-4bit

Base model

Qwen/Qwen2-7B

Finetuned

Quantized

(6)

this model

cyan2k
/

molmo-7B-D-bnb-4bit