MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation Paper • 2411.19067 • Published Nov 28, 2024 • 7
Cosmos Tokenizer Collection A suite of image and video tokenizers • 13 items • Updated 6 days ago • 37
Unified Speech-Text Pretraining for Spoken Dialog Modeling Paper • 2402.05706 • Published Feb 8, 2024 • 6
RDNet Collection DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs [ECCV 2024] • 9 items • Updated Oct 16, 2024 • 3
rope-vit Collection Rotary Position Embedding for Vision Transformer [ECCV 2024] • 22 items • Updated Oct 16, 2024 • 3
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs Paper • 2403.19588 • Published Mar 28, 2024 • 2