ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness Paper • 1811.12231 • Published Nov 29, 2018 • 1
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution Paper • 2307.06304 • Published Jul 12, 2023 • 29
Are Vision Language Models Texture or Shape Biased and Can We Steer Them? Paper • 2403.09193 • Published Mar 14, 2024 • 1
Do generative video models learn physical principles from watching videos? Paper • 2501.09038 • Published 3 days ago • 8