Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
AnyModal
/
Image-Captioning-Llama-3.2-1B
like
1
Follow
AnyModal
5
Image-to-Text
Safetensors
AnyModal/flickr30k
English
AnyModal
vlm
vision
multimodal
License:
mit
Model card
Files
Files and versions
Community
1
fbffa94
Image-Captioning-Llama-3.2-1B
1 contributor
History:
2 commits
ritabratamaiti
Upload 4 files
fbffa94
verified
2 months ago
language_model
Upload 4 files
2 months ago
.gitattributes
1.52 kB
initial commit
2 months ago
README.md
24 Bytes
initial commit
2 months ago
input_tokenizer.pt
39.9 MB
LFS
Upload 4 files
2 months ago