Spaces:

Dovakiins
/

qwerrwe

Build error

Nanobit commited on Dec 29, 2023

Commit

f6ecf14

unverified ·

1 Parent(s): dec66d7

feat: remove need to add load_in* during merge (#1017)

Files changed (2) hide show

README.md CHANGED Viewed

@@ -996,7 +996,7 @@ When you include these tokens in your axolotl config, axolotl adds these tokens
 ### Inference Playground
 Axolotl allows you to load your model in an interactive terminal playground for quick experimentation.
-The config file is the same config file used for training.
 Pass the appropriate flag to the inference command, depending upon what kind of model was trained:
@@ -1027,7 +1027,7 @@ Please use `--sample_packing False` if you have it on and receive the error simi
 Add below flag to train command above
 ```bash
-python3 -m axolotl.cli.merge_lora examples/your_config.yml --lora_model_dir="./completed-model" --load_in_8bit=False --load_in_4bit=False
 ```
 If you run out of CUDA memory, you can try to merge in system RAM with

 ### Inference Playground
 Axolotl allows you to load your model in an interactive terminal playground for quick experimentation.
+The config file is the same config file used for training.
 Pass the appropriate flag to the inference command, depending upon what kind of model was trained:
 Add below flag to train command above
 ```bash
+python3 -m axolotl.cli.merge_lora examples/your_config.yml --lora_model_dir="./completed-model"
 ```
 If you run out of CUDA memory, you can try to merge in system RAM with

src/axolotl/cli/merge_lora.py CHANGED Viewed

@@ -18,7 +18,15 @@ def do_cli(config: Path = Path("examples/"), **kwargs):
         return_remaining_strings=True
     )
     parsed_cli_args.merge_lora = True
-    parsed_cfg = load_cfg(config, merge_lora=True, **kwargs)
     do_merge_lora(cfg=parsed_cfg, cli_args=parsed_cli_args)

         return_remaining_strings=True
     )
     parsed_cli_args.merge_lora = True
+    parsed_cfg = load_cfg(
+        config,
+        merge_lora=True,
+        load_in_8bit=False,
+        load_in_4bit=False,
+        flash_attention=False,
+        **kwargs
+    )
     do_merge_lora(cfg=parsed_cfg, cli_args=parsed_cli_args)