Direct Language Model Alignment from Online AI Feedback Paper • 2402.04792 • Published Feb 7, 2024 • 29
Suppressing Pink Elephants with Direct Principle Feedback Paper • 2402.07896 • Published Feb 12, 2024 • 9
Self-Play Preference Optimization for Language Model Alignment Paper • 2405.00675 • Published May 1, 2024 • 25