CountGD / README.md
nikigoli's picture
Update Paper link (#3)
8d70968 verified
metadata
language:
  - en
library_name: CountGD
license: mit
tags:
  - computer-vision
  - counting
  - grounding-dino
  - model_hub_mixin
  - multi-modal
  - open-vocabulary
  - pytorch_model_hub_mixin
  - transformers

CountGD

A Multi-Modal Open-World Counting Model for counting objects in an image with text and image prompts. For more details, please check out the following links

Sample prediction

Architecture

CountGD Architecture

Citation

@article{AminiNaieni24,
    author       = "Amini-Naieni, N. and Han, T. and Zisserman, A.",
    title        = "CountGD: Multi-Modal Open-World Counting",
    booktitle    = "arxiv",
    year         = "2024",
}