update outputs path so that we can mount workspace to /workspace/data (#1623) 4fde300 unverified winglian commited on May 15, 2024
Add shifted sparse attention (#973) [skip-ci] 1d70f24 unverified jrc joecummings winglian commited on Jan 18, 2024
new evals_per_epoch and saves_per_epoch to make things cleaner (#944) 5f79b82 unverified winglian commited on Dec 12, 2023
don't compile deepspeed or bitsandbytes from source (#837) f544ab2 unverified winglian commited on Nov 9, 2023
simplify by removing duplicate base_model_config (#772) 2d8def6 unverified winglian commited on Oct 23, 2023
prepared dataset caching, other misc fixes (#665) e50a64e unverified winglian commited on Oct 3, 2023
more sane defaults for openllama 3b used for quickstarts (#602) 674c576 unverified winglian commited on Sep 19, 2023
Add wandb_entity to wandb options, update example configs, update README (#361) 7019509 unverified Morgan McGuire Morgan McGuire winglian commited on Aug 12, 2023
Merge pull request #193 from OpenAccess-AI-Collective/config-fixes-20230612 94f310c unverified winglian commited on Jun 12, 2023