streaming multipack for pretraining dataset (#959) 553c80f unverified jinwonkim93 [email protected] winglian commited on Jan 6, 2024
Attention mask and position id fixes for packing (#285) 2bb0b78 unverified winglian commited on Aug 12, 2023