MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions Paper • 2410.02743 • Published Oct 3, 2024 • 7
Tokenization Falling Short: The Curse of Tokenization Paper • 2406.11687 • Published Jun 17, 2024 • 15
HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization Paper • 2402.16694 • Published Feb 26, 2024 • 2
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages Paper • 2212.06742 • Published Dec 13, 2022 • 2