drop length column for issues with eval without packing (#1711) 3f1f5e3 unverified winglian commited on Jun 19, 2024
download model weights on preprocess step (#1693) 5783839 unverified winglian commited on Jun 10, 2024
bump deepspeed for fix for grad norm compute putting tensors on different devices (#1699) 851ccb1 unverified winglian commited on Jun 9, 2024
fix for when sample_packing and eval_sample_packing are different (#1695) 18cabc0 unverified winglian commited on Jun 8, 2024
add back packing efficiency estimate so epochs and multi-gpu works properly (#1697) ed8ef65 unverified winglian commited on Jun 8, 2024
ensure explicit eval_sample_packing to avoid mismatch issues (#1692) 9c1af1a unverified winglian commited on Jun 7, 2024
Phi-3 conversation format, example training script and perplexity metric (#1582) cf64284 unverified roborovski winglian commited on Jun 4, 2024
Fix the broken link in README (#1678) [skip ci] 5cde065 unverified saeedesmaili commited on Jun 3, 2024
cleanup the deepspeed proxy model at the end of training (#1675) d4f6c65 unverified winglian commited on May 30, 2024
set chat_template in datasets config automatically (#1664) 9d4225a unverified winglian commited on May 30, 2024
use mixins for orpo and kto configs so they work with axolotl customizations (#1674) f7332ac unverified winglian commited on May 30, 2024
handle the system role too for chat templates (#1671) b752080 unverified winglian commited on May 29, 2024
make sure the CI fails when pytest script fails (#1669) fe650dd unverified winglian commited on May 29, 2024
Fix README quick start example usage model dirs (#1668) 49b967b unverified Abe Voelker commited on May 28, 2024
Correct name of MixtralBlockSparseTop2MLP (L -> l) (#1667) 65db903 unverified seungduk commited on May 28, 2024
Fix: ensure correct handling of `val_set_size` as `float` or `int` (#1655) 6a5a725 unverified Davide Caroselli winglian commited on May 28, 2024
Generalizing the chat_template prompt strategy (#1660) [skip ci] cc11c6b unverified fozziethebeat commited on May 28, 2024
Fix Google Colab notebook 2024-05 (#1662) [skip ci] 5f91064 unverified Maciek commited on May 28, 2024
document how to use `share_strategy="no"` (#1653) [skip ci] 8a20a7b unverified charlesfrye commited on May 24, 2024
Switch to parallel FFD bin packing algorithm. (#1619) 367b2e8 unverified winglian daaave commited on May 23, 2024
support for custom messages field in sharegpt (#1651) bbfed31 unverified winglian commited on May 23, 2024
Update tiny-llama qlora.yml addressing eval packing error (#1638) 84bb806 unverified Jaydeep Thik commited on May 22, 2024
enable loraplus setting for dpo trainer (#1646) a27d5e1 unverified thepowerfuldeez commited on May 22, 2024
Fix llama3 chat_template (extra <|eot_id|> on last turn) (#1635) 7c2bf30 unverified leonardlin winglian commited on May 21, 2024
more fixes to work with runpod + skypilot (#1629) 0c49ecc unverified winglian commited on May 16, 2024