Phi-3 conversation format, example training script and perplexity metric (#1582) cf64284 unverified roborovski winglian commited on Jun 4, 2024
Generalizing the chat_template prompt strategy (#1660) [skip ci] cc11c6b unverified fozziethebeat commited on May 28, 2024
Fix Google Colab notebook 2024-05 (#1662) [skip ci] 5f91064 unverified Maciek commited on May 28, 2024
Update tiny-llama qlora.yml addressing eval packing error (#1638) 84bb806 unverified Jaydeep Thik commited on May 22, 2024
update outputs path so that we can mount workspace to /workspace/data (#1623) 4fde300 unverified winglian commited on May 15, 2024
fix(yml): update llama-3 config (#1543) [skip ci] 0e8f340 unverified Nanobit commited on Apr 19, 2024
Fix the wrong adapter in qwen2-moe-qlora example (#1501) [skip ci] 7f17eff unverified MaziyarPanahi commited on Apr 9, 2024
turn sample_packing on for training (#1438) [skip ci] c19d060 unverified satpalsr commited on Mar 26, 2024
chore(config): refactor old mistral config (#1435) f1ebaa0 unverified Nanobit commited on Mar 25, 2024
strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428) 2a1589f unverified winglian commited on Mar 21, 2024
Train parameters exclusively in specific ranges (#1390) 05bcc9e unverified seungduk commited on Mar 14, 2024
Update tinyllama lora.yml to fix eval packing issue (#1362) 8984bf1 unverified rasbt commited on Mar 5, 2024
Mps mistral lora (#1292) [skip ci] 0f6af36 unverified Maxime Nanobit winglian commited on Feb 27, 2024
Add instructions for playing with qlora model to colab example (#1290) 6ab69ec unverified Jared Palmer Nanobit JohanWork commited on Feb 21, 2024
fix(examples): remove is_*_derived as it's parsed automatically (#1297) a7a9a14 unverified Nanobit commited on Feb 21, 2024
Add seq2seq eval benchmark callback (#1274) 5a5d474 unverified LeonardoEmili commited on Feb 13, 2024
Update qlora.yml - remove `max_packed_sequence_len` (#1210) [skip ci] 5407ddd unverified 7flash commited on Jan 26, 2024
Fine-Tuning Mistral-7b for Real-World Chatbot Applications Using Axolotl (Lora used) (#1155) cc25039 unverified Tilemachos Chatzipapas twenty8th winglian commited on Jan 23, 2024
set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci] 782b6a4 unverified winglian Nanobit commited on Jan 22, 2024
Add shifted sparse attention (#973) [skip-ci] 1d70f24 unverified jrc joecummings winglian commited on Jan 18, 2024