Jl-wei/guiclip-vit-base-patch32

GUIClip is a vision-language model in GUI domain.

Code and dataset can be found at https://github.com/Jl-wei/guing

If you find our work useful, please cite our paper:

@misc{wei2024guing,
      title={GUing: A Mobile GUI Search Engine using a Vision-Language Model}, 
      author={Jialiang Wei and Anne-Lise Courbis and Thomas Lambolais and Binbin Xu and Pierre Louis Bernard and Gérard Dray and Walid Maalej},
      year={2024},
      eprint={2405.00145},
      archivePrefix={arXiv},
      primaryClass={cs.SE}
}

Please note that the model can only be used for academic purpose.

Jl-wei
/

guiclip-vit-base-patch32

Model tree for Jl-wei/guiclip-vit-base-patch32