K's picture

1 6 9

K

cyk1337

·

AI & ML interests

Large language models.

Organizations

cyk1337's activity

commented a paper 3 months ago

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Paper • 2410.02743 • Published Oct 3, 2024 • 7 •

commented a paper 7 months ago

Tokenization Falling Short: The Curse of Tokenization

Paper • 2406.11687 • Published Jun 17, 2024 • 15 •