OpenFace-CQUPT
/

Human_LLaVA

Visual Question Answering

image-text-to-text

Inference Endpoints

Model card Files Files and versions Community

ponytail commited on Nov 6, 2024

Commit

a70b13c

·

verified ·

1 Parent(s): f5426be

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -103,7 +103,15 @@ verified_shikra: https://github.com/shikras/shikra
 ## Citation
 ```
-Coming soon!!!
 ```
 ## contact

 ## Citation
 ```
+@misc{dai2024humanvlmfoundationhumanscenevisionlanguage,
+      title={HumanVLM: Foundation for Human-Scene Vision-Language Model},
+      author={Dawei Dai and Xu Long and Li Yutang and Zhang Yuanhui and Shuyin Xia},
+      year={2024},
+      eprint={2411.03034},
+      archivePrefix={arXiv},
+      primaryClass={cs.AI},
+      url={https://arxiv.org/abs/2411.03034},
+}
 ```
 ## contact