ControlLLM
/

Control-LLM-Llama3.1-8B-OpenCoder8-Instruct

nielsr HF staff commited on 6 days ago

Commit

0be5dd6

verified ·

1 Parent(s): f6e4188

Add missing metadata (#1)

Files changed (1) hide show

README.md CHANGED Viewed

@@ -74,12 +74,14 @@ model-index:
       value: 0.4029255319148936
       stderr: 0.004471732136513382
       verified: false
 ---
 # Control-LLM-Llama3.1-8B-OpenCoder8
-This is a fine-tuned model of Llama-3.1-8B-Instruct for coding tasks on OpenCoder SFT dataset.
-## Linked Paper
-This model is associated with the paper: [Control-LLM](https://arxiv.org/abs/2501.10979).
 ## Linked Open Source code - training, eval and benchmark
 This model is associated with the github: [Control-LLM](https://github.com/linkedin/ControlLLM).
@@ -117,4 +119,4 @@ The table below summarizes evaluation results across coding tasks and original c
 - **MLU**: MMLU (Massive Multitask Language Understanding)
 - **MLUP**: MMLU Pro
 - **O-Avg**: Original Capability - Size Weighted Average across ARC, GPQA, MMLU, and MMLU Pro
-- **Overall**: Combined average across all tasks

       value: 0.4029255319148936
       stderr: 0.004471732136513382
       verified: false
+pipeline_tag: text-generation
+library_name: transformers
 ---
 # Control-LLM-Llama3.1-8B-OpenCoder8
+This is a fine-tuned model of Llama-3.1-8B-Instruct for coding tasks on OpenCoder SFT dataset described in the paper: [Control LLM: Controlled Evolution for Intelligence Retention in LLM](https://huggingface.co/papers/2501.10979).
+Code: https://github.com/linkedin/ControlLLM.
 ## Linked Open Source code - training, eval and benchmark
 This model is associated with the github: [Control-LLM](https://github.com/linkedin/ControlLLM).
 - **MLU**: MMLU (Massive Multitask Language Understanding)
 - **MLUP**: MMLU Pro
 - **O-Avg**: Original Capability - Size Weighted Average across ARC, GPQA, MMLU, and MMLU Pro
+- **Overall**: Combined average across all tasks