Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model Paper • 2501.02790 • Published 13 days ago • 9
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers Paper • 2501.02393 • Published 15 days ago • 8
Generalizable Origin Identification for Text-Guided Image-to-Image Diffusion Models Paper • 2501.02376 • Published 15 days ago • 3
MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control Paper • 2501.02260 • Published 15 days ago • 5
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Paper • 2501.03847 • Published 12 days ago • 22
Demystifying Domain-adaptive Post-training for Financial LLMs Paper • 2501.04961 • Published 11 days ago • 10
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization Paper • 2412.18525 • Published 26 days ago • 70
MMFactory: A Universal Solution Search Engine for Vision-Language Tasks Paper • 2412.18072 • Published 27 days ago • 17
Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation Paper • 2412.18176 • Published 27 days ago • 15