Ber666's picture

5 4

Ber666

SDSB

ber66666

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

upvoted a paper 27 days ago

Training Large Language Models to Reason in a Continuous Latent Space

View all activity

Organizations

None yet

SDSB's activity

upvoted a paper 14 days ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published 17 days ago • 36

upvoted a paper 27 days ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published 28 days ago • 66

upvoted 2 papers 6 months ago

Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking

Paper • 2406.05673 • Published Jun 9, 2024 • 3

Pandora: Towards General World Model with Natural Language Actions and Video States

Paper • 2406.09455 • Published Jun 12, 2024 • 15

upvoted a paper over 1 year ago

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 17