CharlesLi/llama_3_sky_safe_o1_llama_3_70B_reflect_1000_100_full Text Generation • Updated 17 days ago • 7
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_reflect_1000_500_full Text Generation • Updated 17 days ago • 7
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_reflect_1000_1000_full Text Generation • Updated 17 days ago • 7
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_reflect_4000_100_full Text Generation • Updated 17 days ago • 7
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_reflect_4000_500_full Text Generation • Updated 17 days ago • 7
CharlesLi/llama_3_sky_safe_o1_llama_3_70B_reflect_4000_1000_full Text Generation • Updated 17 days ago • 7
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.1-8B-Instruct-Q4-LoRA8-Batch-16-Tok-1024 Updated 14 days ago
tttx/l3.1-8b-inst-fft-induction-barc-heavy-200k-old-200k-lr1e-5-ep3 Text Generation • Updated 8 days ago • 2