-
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
Paper • 2402.08714 • Published • 12 -
Data Engineering for Scaling Language Models to 128K Context
Paper • 2402.10171 • Published • 24 -
RLVF: Learning from Verbal Feedback without Overgeneralization
Paper • 2402.10893 • Published • 11 -
Coercing LLMs to do and reveal (almost) anything
Paper • 2402.14020 • Published • 13
Collections
Discover the best community collections!
Collections including paper arxiv:2403.06738
-
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Paper • 2309.05793 • Published • 50 -
3D Gaussian Splatting for Real-Time Radiance Field Rendering
Paper • 2308.04079 • Published • 172 -
stabilityai/stable-diffusion-xl-base-1.0
Text-to-Image • Updated • 2.24M • • 6.21k -
Ryukijano/lora-trained-xl-kaggle-p100
Text-to-Image • Updated • 13 • • 1
-
CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model
Paper • 2403.05034 • Published • 21 -
V3D: Video Diffusion Models are Effective 3D Generators
Paper • 2403.06738 • Published • 28 -
FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model
Paper • 2403.10242 • Published • 11
-
Pix2Gif: Motion-Guided Diffusion for GIF Generation
Paper • 2403.04634 • Published • 15 -
StableDrag: Stable Dragging for Point-based Image Editing
Paper • 2403.04437 • Published • 26 -
Teaching Large Language Models to Reason with Reinforcement Learning
Paper • 2403.04642 • Published • 46 -
Yi: Open Foundation Models by 01.AI
Paper • 2403.04652 • Published • 62
-
AtP*: An efficient and scalable method for localizing LLM behaviour to components
Paper • 2403.00745 • Published • 13 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 607 -
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT
Paper • 2402.16840 • Published • 24 -
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper • 2402.13753 • Published • 115
-
Video as the New Language for Real-World Decision Making
Paper • 2402.17139 • Published • 19 -
Learning and Leveraging World Models in Visual Representation Learning
Paper • 2403.00504 • Published • 32 -
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Paper • 2403.01422 • Published • 27 -
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models
Paper • 2403.05438 • Published • 19
-
Video as the New Language for Real-World Decision Making
Paper • 2402.17139 • Published • 19 -
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Paper • 2310.19512 • Published • 15 -
VideoMamba: State Space Model for Efficient Video Understanding
Paper • 2403.06977 • Published • 27 -
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Paper • 2401.09047 • Published • 14