Edit Models filters

Inference status

Misc

arxiv: 2406.18629

Inference Endpoints

AutoTrain Compatible

text-generation-inference

Misc with no match

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

8

Full-text search

Active filters: 2406.18629

xinlai/DeepSeekMath-RL-Step-DPO

Text Generation • Updated Jun 28, 2024 • 31 • 2

xinlai/Llama-3-70B-SFT-Step-DPO

Text Generation • Updated Jun 28, 2024 • 27

xinlai/Qwen2-72B-Instruct-Step-DPO

Text Generation • Updated Jun 28, 2024 • 27

xinlai/Qwen2-7B-SFT-Step-DPO

Text Generation • Updated Jun 28, 2024 • 25

xinlai/DeepSeekMath-Base-SFT-Step-DPO

Text Generation • Updated Jun 28, 2024 • 28

xinlai/Qwen2-57B-A14B-SFT-Step-DPO

Text Generation • Updated Jun 28, 2024 • 24 • 1

xinlai/Qwen1.5-32B-SFT-Step-DPO

Text Generation • Updated Jun 28, 2024 • 22 • 1

RichardErkhov/xinlai_-_Qwen2-7B-SFT-Step-DPO-gguf

Updated Oct 9, 2024 • 19