-
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.2-3B-Instruct-Q4-LoRA8-Batch-16-Tok-1024
Updated -
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.1-8B-Instruct-Q4-LoRA8-Batch-16-Tok-1024
Updated -
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.2-1B-Instruct-Q4-LoRA8-Batch-16-Tok-1024
Updated -
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.1-8B-Instruct-Q4-LoRA8-Batch-16-Tok-1024-Normalized
Updated
RLHF-And-Friends
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
5
models
49
RLHF-And-Friends/Llama-3.2-3B-Instruct-Q4-LoRA8-Batch-1x16-TokIO-960-512-LR-3e-6-NoSysPrompt
Updated
RLHF-And-Friends/Llama-3.2-3B-Instruct-Q4-LoRA8-Batch-2x16-TokIO-960-960-LR-3e-6-NoSysPrompt
Updated
RLHF-And-Friends/Llama-3.2-1B-Instruct-Q4-LoRA8-Batch-1x16-TokIO-960-512-LR-3e-6-NoSysPrompt-RM-3B
Updated
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.2-1B-Instruct-Q4-LoRA8-Batch-16-Tok-1024-Normalized
Updated
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.1-8B-Instruct-Q4-LoRA8-Batch-16-Tok-1024-Normalized
Updated
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.2-3B-Instruct-Q4-LoRA8-Batch-16-Tok-1024-Normalized
Updated
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.1-8B-Instruct-Q4-LoRA8-Batch-16-Tok-1024
Updated
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.2-1B-Instruct-Q4-LoRA8-Batch-16-Tok-1024
Updated
RLHF-And-Friends/Llama-3.2-1B-Instruct-Q4-LoRA8-Batch-3x16-TokIO-960-512-LR-3e-6-NoSysPrompt-RM-3B
Updated
RLHF-And-Friends/Llama-3.2-1B-Instruct-Q4-LoRA8-Batch-3x16-TokIO-960-512-LR-3e-6-NoSysPrompt
Updated