Commit History
add support for rpo_alpha (#1681)
c996881
unverified
winglian
commited on
re-enable DPO for tests in modal ci (#1374)
1f151c0
unverified
winglian
commited on
need to add back drop_last for sampler (#1676)
05b0bd0
unverified
winglian
commited on
cleanup the deepspeed proxy model at the end of training (#1675)
d4f6c65
unverified
winglian
commited on
load explicit splits on datasets (#1652)
a944f7b
unverified
winglian
commited on
set chat_template in datasets config automatically (#1664)
9d4225a
unverified
winglian
commited on
use mixins for orpo and kto configs so they work with axolotl customizations (#1674)
f7332ac
unverified
winglian
commited on
revert multipack batch sampler changes (#1672)
a6b37bd
unverified
winglian
commited on
handle the system role too for chat templates (#1671)
b752080
unverified
winglian
commited on
make sure the CI fails when pytest script fails (#1669)
fe650dd
unverified
winglian
commited on
Correct name of MixtralBlockSparseTop2MLP (L -> l) (#1667)
65db903
unverified
seungduk
commited on
Fix: ensure correct handling of `val_set_size` as `float` or `int` (#1655)
6a5a725
unverified
Generalizing the chat_template prompt strategy (#1660) [skip ci]
cc11c6b
unverified
fozziethebeat
commited on
support for custom messages field in sharegpt (#1651)
bbfed31
unverified
winglian
commited on
enable loraplus setting for dpo trainer (#1646)
a27d5e1
unverified
thepowerfuldeez
commited on
allow report_to for multiple providers (#1647)
6299eb5
unverified
winglian
commited on
Fix llama3 chat_template (extra <|eot_id|> on last turn) (#1635)
7c2bf30
unverified
Add KTO support (#1640)
22ae21a
unverified
fixes to save on fractional save_steps (#1643)
ba45531
unverified
winglian
commited on
Unsloth optims for Llama (#1609)
8a1572a
unverified
winglian
commited on
add save_only_model option (#1634)
702a669
unverified
emozilla
commited on
Fix `total_num_steps` (#1566)
81da7d2
unverified
bofenghuang
commited on
FIX: max_length and max_prompt_length was not being sent to ORPOTrainer (#1584)
1e1921b
unverified
make sure to save on the last step (#1615)
1634ac8
unverified
winglian
commited on
fix attention mask collation (#1603)
0298273
unverified
winglian
commited on
feat: Add LLaMA-3 instruct prompt strategies for fine-tuning (#1553)
50421c8
unverified
adding llama3 fastchat conversation monkeypatch (#1539)
b32c08f
unverified
ignore the fsdp_config section too (#1606) [skip ci]
fff06af
unverified
winglian
commited on
make sure to save the lora adapter at the end of RL/dpo training (#1573)
796a085
unverified
winglian
commited on
improve tool handling roles (#1587)
cb78a36
unverified
winglian
commited on
feat: exclude mamba blocks for jamba (#1578)
8b9c15b
unverified
Nanobit
commited on
Pass deepspeed and fsdp as None explicitly when merging adapters to allow custom device_map (#1575)
9e1480e
unverified
chiragjn
commited on
improve save callbacks (#1592)
29cf15a
unverified
winglian
commited on
FIX: TRL trainer preprocessing step was running in one process (#1583)
b9bb169
unverified
Ali Mosavian
Ali Mosavian
commited on
ADD: warning hub model (#1301)
601c08b
unverified
PoSE context length ext (#1567)
5294653
unverified
winglian
commited on
make sure everything stays in the same dtype when using dpo + FSDP (#1559)
68601ec
unverified
winglian
commited on
Add support for Gemma chat template (#1530)
60f5ce0
unverified
wrap prepared_ds_path in str() to avoid TypeError in fsspec package (#1548)
7477a53
unverified
ORPO Trainer replacement (#1551)
7d1d22f
unverified
winglian
commited on
Unsloth gradient checkpointing offload (#1528)
6319da1
unverified
winglian
commited on
DBRX Model Support (#1462)
132eb74
unverified
winglian
commited on
use locale agnostic seperator to make large nums easier to read (#1503)
da9b1a3
unverified
winglian
commited on