Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Reverb
/
Idefics2-8b-docVQA-finetuned
like
3
Image-Text-to-Text
Transformers
Safetensors
12 datasets
English
idefics2
multimodal
vision
text-generation-inference
Inference Endpoints
arxiv:
2007.00398
arxiv:
2306.16527
arxiv:
2405.02246
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Idefics2-8b-docVQA-finetuned
/
added_tokens.json
Reverb
Upload 8 files
4444407
verified
8 months ago
raw
Copy download link
history
blame
contribute
delete
Safe
92 Bytes
{
"<end_of_utterance>"
:
32002
,
"<fake_token_around_image>"
:
32000
,
"<image>"
:
32001
}