Cly
Akikaaa
AI & ML interests
None yet
Recent Activity
updated
a model
4 days ago
Akikaaa/ppo-LunarLander-v2
new activity
9 days ago
Qwen/Qwen2.5-0.5B-Instruct:Does this model apply SFT or SFT+RL during post-training?
Organizations
None yet
Akikaaa's activity
Does this model apply SFT or SFT+RL during post-training?
#8 opened 9 days ago
by
Akikaaa