Add training callback to send predictions to WandB table (#521) 5b67ea9 unverified Glavin001 commited on Sep 13, 2023
Add support for GPTQ using native transformers/peft (#468) 3355706 unverified winglian commited on Sep 5, 2023
Added advanced DDP args (#515) 396a7a7 unverified Jan Philipp Harries Jan Philipp Harries commited on Aug 31, 2023
pad_to_worst_case_seq_len boolean, for testing memory limits (#498) 8e197f6 unverified Birch-san tmm1 commited on Aug 28, 2023
ReLoRA implementation (with quantization) (#322) bde3c5a unverified chargoddard winglian commited on Aug 24, 2023
use save_strategy from config if available (#434) b3f5e00 unverified winglian commited on Aug 19, 2023
Fix(config): Update handling of deepspeed config (#404) c01015f unverified Nanobit commited on Aug 15, 2023
Attention mask and position id fixes for packing (#285) 2bb0b78 unverified winglian commited on Aug 12, 2023
Merge branch 'OpenAccess-AI-Collective:main' into logging_enhancement 83237b8 unverified The Objective Dad commited on Jul 15, 2023
Merge pull request #274 from OpenAccess-AI-Collective/NanoCode012-patch-2 168a7a0 unverified Nanobit commited on Jul 14, 2023
fix bettertransformers save, force it to skip after saving correctly in callback 1a82082 winglian commited on Jun 1, 2023