Peng Shangpin's picture

1 2 1

Peng Shangpin

psp-dada

·

https://github.com/pspdada

pspdada

AI & ML interests

MultiModal

Recent Activity

upvoted a paper about 2 months ago

Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs

new activity about 2 months ago

openbmb/RLHF-V:Error when loading this model

View all activity

Organizations

None yet

psp-dada's activity

upvoted a paper about 2 months ago

Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs

Paper • 2406.18629 • Published Jun 26, 2024 • 41

upvoted an article 5 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

• 226