Peng Shangpin
psp-dada
ยท
AI & ML interests
MultiModal
Recent Activity
upvoted
a
paper
about 2 months ago
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of
LLMs
new activity
about 2 months ago
openbmb/RLHF-V:Error when loading this model
Organizations
None yet
models
None public yet
datasets
None public yet