LeMaterial: an open source initiative to accelerate materials discovery and research 22 days ago • 31
Scaling Diffusion Language Models via Adaptation from Autoregressive Models Paper • 2410.17891 • Published Oct 23, 2024 • 15
WorldSimBench: Towards Video Generation Models as World Simulators Paper • 2410.18072 • Published Oct 23, 2024 • 18
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models Paper • 2410.17637 • Published Oct 23, 2024 • 34
Scalable Ranked Preference Optimization for Text-to-Image Generation Paper • 2410.18013 • Published Oct 23, 2024 • 14
DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes Paper • 2410.18084 • Published Oct 23, 2024 • 13
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding Paper • 2410.17434 • Published Oct 22, 2024 • 25
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding Paper • 2410.13924 • Published Oct 17, 2024 • 6
TP-Eval: Tap Multimodal LLMs' Potential in Evaluation by Customizing Prompts Paper • 2410.18071 • Published Oct 23, 2024 • 6
M-RewardBench: Evaluating Reward Models in Multilingual Settings Paper • 2410.15522 • Published Oct 20, 2024 • 11
ZePo: Zero-Shot Portrait Stylization with Faster Sampling Paper • 2408.05492 • Published Aug 10, 2024 • 7
DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology Paper • 2404.05022 • Published Apr 7, 2024 • 2
BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval Paper • 2403.15992 • Published Mar 24, 2024 • 1
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles Paper • 2410.05262 • Published Oct 7, 2024 • 9
TLDR: Token-Level Detective Reward Model for Large Vision Language Models Paper • 2410.04734 • Published Oct 7, 2024 • 16