Amartya77
/

RLHF_PPOppo_model

Reinforcement Learning

text2text-generation

Inference Endpoints

Model card Files Files and versions Community

README.md exists but content is empty.

Downloads last month: 4

Safetensors

Model size

582M params

Tensor type

F32

·

Video Preview

Reinforcement Learning

loading