Cly's picture

1 1

Cly

Akikaaa

AI & ML interests

None yet

Recent Activity

updated a model 4 days ago

Akikaaa/ppo-LunarLander-v2

new activity 9 days ago

Qwen/Qwen2.5-0.5B-Instruct:Does this model apply SFT or SFT+RL during post-training?

View all activity

Organizations

None yet

Akikaaa's activity

updated a model 4 days ago

Akikaaa/ppo-LunarLander-v2

Reinforcement Learning • Updated 4 days ago • 2

New activity in Qwen/Qwen2.5-0.5B-Instruct 9 days ago

Does this model apply SFT or SFT+RL during post-training?

#8 opened 9 days ago by

liked a model 2 months ago

mlabonne/Meta-Llama-3-8B

Text Generation • Updated May 2, 2024 • 134 • 1