Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models Paper • 2410.07176 • Published Oct 9, 2024 • 1
Data Advisor Collection [EMNLP 2024] Dynamic and Constitutional Data Curation for LLMs • 3 items • Updated Oct 13, 2024 • 1
Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models Paper • 2410.05269 • Published Oct 7, 2024 • 3
SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe Paper • 2410.05248 • Published Oct 7, 2024 • 8
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs Paper • 2410.05295 • Published Oct 3, 2024 • 12
Unraveling Cross-Modality Knowledge Conflict in Large Vision-Language Models Paper • 2410.03659 • Published Oct 4, 2024 • 6
WPO Collection Models and datasets in paper "WPO: Enhancing RLHF with Weighted Preference Optimization". • 11 items • Updated Aug 22, 2024 • 5
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding Paper • 2406.09411 • Published Jun 13, 2024 • 18
Rethinking Tabular Data Understanding with Large Language Models Paper • 2312.16702 • Published Dec 27, 2023 • 4
WPO: Enhancing RLHF with Weighted Preference Optimization Paper • 2406.11827 • Published Jun 17, 2024 • 14
mDPO: Conditional Preference Optimization for Multimodal Large Language Models Paper • 2406.11839 • Published Jun 17, 2024 • 37