yunyangx
/

efficient-track-anything

Model card Files Files and versions Community

efficient-track-anything / README.md

yunyangx's picture

Update README.md

9bdd8ab verified 17 days ago

|

history blame contribute delete

1.03 kB

	---
	license: apache-2.0
	---

	# Efficient Track Anything
	[[`🤗Checkpoints`]](https://huggingface.co/yunyangx/efficient-track-anything/tree/main)[[`📕Project`](https://yformer.github.io/efficient-track-anything/)][[`🤗Gradio Demo`](https://bea2c478296e25b3ce.gradio.live)][[`📕Paper`](https://arxiv.org/pdf/2411.18933)]

	The Efficient Track Anything Model(EfficientTAM) takes a vanilla lightweight ViT image encoder. An efficient memory cross-attention is proposed to further improve the efficiency. Our EfficientTAMs are trained on SA-1B (image) and SA-V (video) datasets. EfficientTAM achieves comparable performance with SAM 2 with improved efficiency. Our EfficientTAM can run >10 frames per second with reasonable video segmentation performance on iPhone 15. Try our demo with a family of EfficientTAMs at [[`🤗Gradio Demo`](https://bea2c478296e25b3ce.gradio.live)].

	This repository contains a family of EfficientTAMs with checkpoints for practical deployments with different latency and quality needs.