view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 135
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q2_K-GGUF Text Generation • Updated 5 days ago • 68 • 1
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q2_K-GGUF Text Generation • Updated 5 days ago • 68 • 1
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q8_0-GGUF Text Generation • Updated 5 days ago • 103
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q8_0-GGUF Text Generation • Updated 5 days ago • 103
AlicanKiraz0/SenecaLLM-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Q4_K_M-GGUF Text Generation • Updated 5 days ago • 43
AlicanKiraz0/SenecaLLM-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Q4_K_M-GGUF Text Generation • Updated 5 days ago • 43