Standardize system prompt format for AlpacaPrompter (#1190) [skip ci] af02430 unverified Oleh Kuznetsov commited on Jan 24, 2024
upgrade deepspeed to 0.13.1 for mixtral fixes (#1189) [skip ci] 8a49309 unverified winglian commited on Jan 24, 2024
more dpo fixes for dataset loading and docs (#1185) [skip ci] 5bce45f unverified winglian commited on Jan 24, 2024
report min lenght of tokenized data (#1186) [skip ci] d85d494 unverified winglian commited on Jan 24, 2024
Fix generation_config validation raises Exception for do_merge_lora (#1184) 02f2c72 unverified tisorlawan commited on Jan 24, 2024
Add support for offline mode with HF_HUB_OFFLINE envvar (#1182) 71141de unverified James Wade winglian commited on Jan 24, 2024
don't fail if can't cast weights due to offload when merging (#1172) [skip ci] fb7f9b9 unverified winglian commited on Jan 23, 2024
Fine-Tuning Mistral-7b for Real-World Chatbot Applications Using Axolotl (Lora used) (#1155) cc25039 unverified Tilemachos Chatzipapas twenty8th winglian commited on Jan 23, 2024
Feat(test): Add tests for alpaca chatml prompt tokenizer (#1088) 5439707 unverified JohanWork Nanobit commited on Jan 23, 2024
support for explicit test_dataset definition for evals (#786) cda52dc unverified winglian commited on Jan 23, 2024
add commit message option to skip docker image builds in ci (#1168) [skip ci] 0f77b8d unverified winglian commited on Jan 23, 2024
improve vram use w gradient checkpointing (#1167) [skip ci] 802f966 unverified winglian commited on Jan 23, 2024
Add mlflow callback for pushing config to mlflow artifacts (#1125) b8e5603 unverified JohanWork commited on Jan 22, 2024
set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci] 782b6a4 unverified winglian Nanobit commited on Jan 22, 2024
make sure the model config loader respects the model_revision too (#1160) [skip-ci] fccb542 unverified winglian commited on Jan 22, 2024
feat(dataset): add config to keep processed dataset in memory (#1152) 3db5f2f unverified Nanobit commited on Jan 20, 2024
Add shifted sparse attention (#973) [skip-ci] 1d70f24 unverified jrc joecummings winglian commited on Jan 18, 2024
fix(preprocess): Make sure dataset not loaded from cache when using preprocess cli (#1136) 1e56b88 unverified Nanobit commited on Jan 17, 2024
Agnostic cloud gpu docker image and Jupyter lab (#1097) ece0211 unverified winglian commited on Jan 16, 2024
Add `layers_to_transform` for `lora_config` (#1118) 8487b97 unverified xzuyn commited on Jan 16, 2024
fix(readme): clarify custom user prompt [no-ci] (#1124) 9cd27b2 unverified Nanobit commited on Jan 16, 2024
update PR template so we can capture twitter or discord handles (#1121) [skip ci] 0abf4d6 unverified winglian commited on Jan 14, 2024
Enable or disable bf16 support based on availability (#1116) 0865613 unverified Simon Hällqvist commited on Jan 14, 2024
Disable caching on `--disable_caching` in CLI (#1110) d66b101 unverified casperhansen winglian commited on Jan 13, 2024
Add link on README to Docker Debugging (#1107) 2dc4310 unverified hamel winglian commited on Jan 12, 2024
Add section for debugging with Docker (#1104) 6d342b5 unverified hamel winglian commited on Jan 12, 2024