tsystems
/

colqwen2-2b-v1.0-merged

multimodal_embedding

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tattrongvu commited on 6 days ago

Commit

a6cbaba

·

verified ·

1 Parent(s): 1756456

Update README.md

Files changed (1) hide show

README.md +19 -2

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-license: mit
 datasets:
 - tattrongvu/vqa_de_en_batch1
 - vidore/colpali_train_set
@@ -99,4 +99,21 @@ scores = processor.score_multi_vector(query_embeddings, image_embeddings)
 ## License
-ColQwen2's vision language backbone model (Qwen2-VL) is under `apache2.0` license. The adapters attached to the model are under MIT license.

 ---
+license: cc
 datasets:
 - tattrongvu/vqa_de_en_batch1
 - vidore/colpali_train_set
 ## License
+ColQwen2's vision language backbone model (Qwen2-VL) is under `apache2.0` license.
+This fine-tuned adapter is under **CC BY NC 4.0 license**. Therefore, the use of the model is **research only** at the moment.
+## Citation
+If you use this models from this organization in your research, please cite the original paper as follows:
+```bibtex
+@misc{faysse2024colpaliefficientdocumentretrieval,
+  title={ColPali: Efficient Document Retrieval with Vision Language Models},
+  author={Manuel Faysse and Hugues Sibille and Tony Wu and Bilel Omrani and Gautier Viaud and Céline Hudelot and Pierre Colombo},
+  year={2024},
+  eprint={2407.01449},
+  archivePrefix={arXiv},
+  primaryClass={cs.IR},
+  url={https://arxiv.org/abs/2407.01449},
+}
+```