Does this model apply SFT or SFT+RL during post-training?

#8
by Akikaaa - opened

Does this model apply SFT or SFT+RL during post-training?

Sign up or log in to comment