Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers Paper • 2310.05400 • Published Oct 9, 2023 • 1
TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing Paper • 2203.17266 • Published Mar 31, 2022
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts Paper • 2402.10958 • Published Feb 12, 2024
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model Paper • 2501.02790 • Published 3 days ago • 7
Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis Paper • 2312.03491 • Published Dec 6, 2023 • 33
Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling Paper • 2408.03695 • Published Aug 7, 2024 • 13
What If We Recaption Billions of Web Images with LLaMA-3? Paper • 2406.08478 • Published Jun 12, 2024 • 39
UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures Paper • 2401.11078 • Published Jan 20, 2024 • 7
CORE-MM: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models Paper • 2311.11567 • Published Nov 20, 2023 • 8
Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond Paper • 2304.04968 • Published Apr 11, 2023
Exploiting Chain Rule and Bayes' Theorem to Compare Probability Distributions Paper • 2012.14100 • Published Dec 28, 2020
Mixing and Shifting: Exploiting Global and Local Dependencies in Vision MLPs Paper • 2202.06510 • Published Feb 14, 2022
Contrastive Attraction and Contrastive Repulsion for Representation Learning Paper • 2105.03746 • Published May 8, 2021 • 1
DR2: Diffusion-based Robust Degradation Remover for Blind Face Restoration Paper • 2303.06885 • Published Mar 13, 2023