swap batch size for gradient accumulation steps to decouple from num gpu c2a0792 winglian commited on May 31, 2023
Update wandb_log_model on llama_7B_jeopardy.yml 15aabd2 unverified Viktorius Suwandi commited on May 29, 2023