Transformers
Safetensors
English
German
qwen2_vl
multimodal_embedding
text-generation-inference
Inference Endpoints
tattrongvu commited on
Commit
a6cbaba
·
verified ·
1 Parent(s): 1756456

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -2
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- license: mit
3
  datasets:
4
  - tattrongvu/vqa_de_en_batch1
5
  - vidore/colpali_train_set
@@ -99,4 +99,21 @@ scores = processor.score_multi_vector(query_embeddings, image_embeddings)
99
 
100
  ## License
101
 
102
- ColQwen2's vision language backbone model (Qwen2-VL) is under `apache2.0` license. The adapters attached to the model are under MIT license.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: cc
3
  datasets:
4
  - tattrongvu/vqa_de_en_batch1
5
  - vidore/colpali_train_set
 
99
 
100
  ## License
101
 
102
+ ColQwen2's vision language backbone model (Qwen2-VL) is under `apache2.0` license.
103
+ This fine-tuned adapter is under **CC BY NC 4.0 license**. Therefore, the use of the model is **research only** at the moment.
104
+
105
+ ## Citation
106
+
107
+ If you use this models from this organization in your research, please cite the original paper as follows:
108
+
109
+ ```bibtex
110
+ @misc{faysse2024colpaliefficientdocumentretrieval,
111
+ title={ColPali: Efficient Document Retrieval with Vision Language Models},
112
+ author={Manuel Faysse and Hugues Sibille and Tony Wu and Bilel Omrani and Gautier Viaud and Céline Hudelot and Pierre Colombo},
113
+ year={2024},
114
+ eprint={2407.01449},
115
+ archivePrefix={arXiv},
116
+ primaryClass={cs.IR},
117
+ url={https://arxiv.org/abs/2407.01449},
118
+ }
119
+ ```