GUIClip
is a vision-language model in GUI domain.
Code and dataset can be found at https://github.com/Jl-wei/guing
If you find our work useful, please cite our paper:
@misc{wei2024guing,
title={GUing: A Mobile GUI Search Engine using a Vision-Language Model},
author={Jialiang Wei and Anne-Lise Courbis and Thomas Lambolais and Binbin Xu and Pierre Louis Bernard and Gérard Dray and Walid Maalej},
year={2024},
eprint={2405.00145},
archivePrefix={arXiv},
primaryClass={cs.SE}
}
Please note that the model can only be used for academic purpose.
- Downloads last month
- 87
Model tree for Jl-wei/guiclip-vit-base-patch32
Base model
openai/clip-vit-base-patch32