Model parameters: d_model 2048 ffw_size 8192 kv_size 128 n_heads 16 n_layers 28 Megatron-DeepSpeed/pretrain_gpt.py --tensor-model-parallel-size 1 --pipeline-model-parallel-size 1 --num-layers 28 --hidden-size 2048 --num-attention-heads 16 --kv-channels 128 --ffn-hidden-size 8192 --seq-length 2048 --max-position-embeddings 2048 --micro-batch-size 2 --global-batch-size 256 --train-samples 32_109_839 --vocab-file gpt2/vocab.json --merge-file gpt2/merges.txt --clip-grad 1.0 --kill-switch-path kill-switch-1b566b66b --bf16 --optimizer adam --adam-beta1 0.9 --adam-beta2 0.999 --adam-eps 1e-8 --lr 2e-4 --min-lr 2e-5 --lr-decay-style cosine --lr-decay-samples 32_109_839 --lr-warmup-samples 321_098 --clip-grad 1.0 --weight-decay 1e-1 --log-interval 10 --save-interval 1000 --eval-interval 1000 --eval-iters 100 --eval-only true --tensorboard-dir tensorboard_1b566b66b --tensorboard-queue-size 5 --log-timers-to-tensorboard --log-batch-size-to-tensorboard --log-validation-ppl-to-tensorboard --save lm1-1b5-66b --load lm1-1b5-66b --train-weighted-split-paths-path train12b.txt --valid-weighted-split-paths-path val.txt --data-impl mmap --deepspeed --deepspeed_config ds_configs/2806053.json --zero-stage 0 START 2806053: Fri Feb 3 16:30:36 EET 2023 0: 0: 0: ======================= ROCm System Management Interface ======================= 0: ================================= Concise Info ================================= 0: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 0: 0 45.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 0: 1 51.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 0: 2 40.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 0: 3 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 0: 4 40.0c 97.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 0: 5 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 0: 6 42.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 0: 7 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 0: ================================================================================ 0: ============================= End of ROCm SMI Log ============================== 15: 15: 15: ======================= ROCm System Management Interface ======================= 15: ================================= Concise Info ================================= 15: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 15: 0 46.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 15: 1 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 15: 2 43.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 15: 3 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 15: 4 45.0c 84.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 15: 5 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 15: 6 36.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 15: 7 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 15: ================================================================================ 15: ============================= End of ROCm SMI Log ============================== 5: 5: 5: ======================= ROCm System Management Interface ======================= 5: ================================= Concise Info ================================= 5: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 5: 0 42.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 5: 1 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 5: 2 44.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 5: 3 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 5: 4 43.0c 95.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 5: 5 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 5: 6 37.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 5: 7 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 5: ================================================================================ 5: ============================= End of ROCm SMI Log ============================== 2: 2: 2: ======================= ROCm System Management Interface ======================= 2: ================================= Concise Info ================================= 2: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 2: 0 45.0c 98.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 2: 1 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 2: 2 39.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 2: 3 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 2: 4 36.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 2: 5 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 2: 6 42.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 2: 7 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 2: ================================================================================ 2: ============================= End of ROCm SMI Log ============================== 7: 7: 7: ======================= ROCm System Management Interface ======================= 7: ================================= Concise Info ================================= 7: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 7: 0 40.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 7: 1 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 7: 2 39.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 7: 3 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 7: 4 43.0c 84.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 7: 5 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 7: 6 39.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 7: 7 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 7: ================================================================================ 7: ============================= End of ROCm SMI Log ============================== 14: 14: 14: ======================= ROCm System Management Interface ======================= 14: ================================= Concise Info ================================= 14: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 14: 0 46.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 14: 1 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 14: 2 39.0c 84.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 14: 3 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 14: 4 42.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 14: 5 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 14: 6 44.0c 82.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 14: 7 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 14: ================================================================================ 14: ============================= End of ROCm SMI Log ============================== 10: 10: 10: ======================= ROCm System Management Interface ======================= 10: ================================= Concise Info ================================= 10: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 10: 0 42.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 10: 1 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 10: 2 41.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 10: 3 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 10: 4 41.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 10: 5 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 10: 6 41.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 10: 7 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 10: ================================================================================ 10: ============================= End of ROCm SMI Log ============================== 13: 13: 13: ======================= ROCm System Management Interface ======================= 13: ================================= Concise Info ================================= 13: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 13: 0 46.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 13: 1 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 13: 2 43.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 13: 3 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 13: 4 40.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 13: 5 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 13: 6 41.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 13: 7 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 13: ================================================================================ 13: ============================= End of ROCm SMI Log ============================== 8: 8: 8: ======================= ROCm System Management Interface ======================= 8: ================================= Concise Info ================================= 8: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 8: 0 47.0c 96.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 8: 1 50.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 8: 2 45.0c 83.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 8: 3 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 8: 4 41.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 8: 5 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 8: 6 39.0c 95.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 8: 7 38.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 8: ================================================================================ 8: ============================= End of ROCm SMI Log ============================== 12: 12: 12: ======================= ROCm System Management Interface ======================= 12: ================================= Concise Info ================================= 12: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 12: 0 43.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 12: 1 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 12: 2 40.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 12: 3 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 12: 4 45.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 12: 5 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 12: 6 47.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 12: 7 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 12: ================================================================================ 12: ============================= End of ROCm SMI Log ============================== 4: 4: 4: ======================= ROCm System Management Interface ======================= 4: ================================= Concise Info ================================= 4: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 4: 0 41.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 4: 1 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 4: 2 44.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 4: 3 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 4: 4 40.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 4: 5 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 4: 6 41.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 4: 7 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 4: ================================================================================ 4: ============================= End of ROCm SMI Log ============================== 3: 3: 3: ======================= ROCm System Management Interface ======================= 3: ================================= Concise Info ================================= 3: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 3: 0 44.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 3: 1 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 3: 2 40.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 3: 3 40.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 3: 4 42.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 3: 5 41.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 3: 6 43.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 3: 7 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 3: ================================================================================ 3: ============================= End of ROCm SMI Log ============================== 9: 9: 9: ======================= ROCm System Management Interface ======================= 9: ================================= Concise Info ================================= 9: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 9: 0 45.0c 95.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 9: 1 50.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 9: 2 38.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 9: 3 41.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 9: 4 47.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 9: 5 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 9: 6 33.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 9: 7 41.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 9: ================================================================================ 9: ============================= End of ROCm SMI Log ============================== 6: 6: 6: ======================= ROCm System Management Interface ======================= 6: ================================= Concise Info ================================= 6: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 6: 0 45.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 6: 1 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 6: 2 37.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 6: 3 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 6: 4 44.0c 82.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 6: 5 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 6: 6 39.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 6: 7 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 6: ================================================================================ 6: ============================= End of ROCm SMI Log ============================== 11: 11: 11: ======================= ROCm System Management Interface ======================= 11: ================================= Concise Info ================================= 11: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 11: 0 48.0c 95.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 11: 1 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 11: 2 37.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 11: 3 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 11: 4 43.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 11: 5 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 11: 6 42.0c 85.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 11: 7 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 11: ================================================================================ 11: ============================= End of ROCm SMI Log ============================== 1: 1: 1: ======================= ROCm System Management Interface ======================= 1: ================================= Concise Info ================================= 1: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 1: 0 48.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 1: 1 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 1: 2 42.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 1: 3 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 1: 4 43.0c 81.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 1: 5 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 1: 6 41.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 1: 7 42.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 1: ================================================================================ 1: ============================= End of ROCm SMI Log ============================== 1: Launching on nid006471 (1/16), master nid006470 port 9999, GPUs 8, CUDA: True 12: Launching on nid006482 (12/16), master nid006470 port 9999, GPUs 8, CUDA: True 6: Launching on nid006476 (6/16), master nid006470 port 9999, GPUs 8, CUDA: True 13: Launching on nid006483 (13/16), master nid006470 port 9999, GPUs 8, CUDA: True 5: Launching on nid006475 (5/16), master nid006470 port 9999, GPUs 8, CUDA: True 3: Launching on nid006473 (3/16), master nid006470 port 9999, GPUs 8, CUDA: True 10: Launching on nid006480 (10/16), master nid006470 port 9999, GPUs 8, CUDA: True 11: Launching on nid006481 (11/16), master nid006470 port 9999, GPUs 8, CUDA: True 2: Launching on nid006472 (2/16), master nid006470 port 9999, GPUs 8, CUDA: True 14: Launching on nid006484 (14/16), master nid006470 port 9999, GPUs 8, CUDA: True 4: Launching on nid006474 (4/16), master nid006470 port 9999, GPUs 8, CUDA: True 9: Launching on nid006479 (9/16), master nid006470 port 9999, GPUs 8, CUDA: True 8: Launching on nid006478 (8/16), master nid006470 port 9999, GPUs 8, CUDA: True 0: Launching on nid006470 (0/16), master nid006470 port 9999, GPUs 8, CUDA: True 7: Launching on nid006477 (7/16), master nid006470 port 9999, GPUs 8, CUDA: True 15: Launching on nid006485 (15/16), master nid006470 port 9999, GPUs 8, CUDA: True 0: using world size: 128, data-parallel-size: 128, tensor-model-parallel size: 1, pipeline-model-parallel size: 1 0: accumulate and all-reduce gradients in fp32 for bfloat16 data type. 0: using torch.bfloat16 for parameters ... 0: ------------------------ arguments ------------------------ 0: abort_on_unmet_fused_kernel_constraints ......... False 0: accumulate_allreduce_grads_in_fp32 .............. True 0: adam_beta1 ...................................... 0.9 0: adam_beta2 ...................................... 0.999 0: adam_eps ........................................ 1e-08 0: adlr_autoresume ................................. False 0: adlr_autoresume_interval ........................ 1000 0: apply_query_key_layer_scaling ................... True 0: apply_residual_connection_post_layernorm ........ False 0: attention_dropout ............................... 0.1 0: attention_softmax_in_fp32 ....................... False 0: bert_binary_head ................................ True 0: bert_load ....................................... None 0: bf16 ............................................ True 0: bias_dropout_fusion ............................. True 0: bias_gelu_fusion ................................ True 0: biencoder_projection_dim ........................ 0 0: biencoder_shared_query_context_model ............ False 0: block_data_path ................................. None 0: checkpoint_activations .......................... False 0: checkpoint_in_cpu ............................... False 0: checkpoint_num_layers ........................... 1 0: clip_grad ....................................... 1.0 0: codecarbon_dir .................................. None 0: consumed_train_samples .......................... 0 0: consumed_train_tokens ........................... 0 0: consumed_valid_samples .......................... 0 0: contigious_checkpointing ........................ False 0: cpu_optimizer ................................... False 0: cpu_torch_adam .................................. False 0: curriculum_learning ............................. False 0: data_impl ....................................... mmap 0: data_parallel_size .............................. 128 0: data_path ....................................... None 0: dataloader_type ................................. single 0: DDP_impl ........................................ local 0: decoder_seq_length .............................. None 0: deepscale ....................................... False 0: deepscale_config ................................ None 0: deepspeed ....................................... True 0: deepspeed_activation_checkpointing .............. False 0: deepspeed_config ................................ ds_configs/2806053.json 0: deepspeed_mpi ................................... False 0: distribute_checkpointed_activations ............. False 0: distributed_backend ............................. nccl 0: embed_layernorm ................................. False 0: embedding_path .................................. None 0: encoder_seq_length .............................. 2048 0: eod_mask_loss ................................... False 0: eval_interval ................................... 1000 0: eval_iters ...................................... 100 0: eval_only ....................................... True 0: evidence_data_path .............................. None 0: exit_duration_in_mins ........................... None 0: exit_interval ................................... None 0: ffn_hidden_size ................................. 8192 0: finetune ........................................ False 0: fp16 ............................................ False 0: fp16_lm_cross_entropy ........................... False 0: fp32_residual_connection ........................ False 0: gigaflos_no_embeds .............................. 0 0: global_batch_size ............................... 256 0: glu_activation .................................. None 0: hidden_dropout .................................. 0.1 0: hidden_size ..................................... 2048 0: hysteresis ...................................... 2 0: ict_head_size ................................... None 0: ict_load ........................................ None 0: img_dim ......................................... 224 0: indexer_batch_size .............................. 128 0: indexer_log_interval ............................ 1000 0: inference ....................................... False 0: init_method_std ................................. 0.02 0: init_method_xavier_uniform ...................... False 0: initial_loss_scale .............................. 4294967296 0: kill_switch_path ................................ kill-switch-1b566b66b 0: kv_channels ..................................... 128 0: layer_norm_fusion ............................... True 0: layernorm_epsilon ............................... 1e-05 0: lazy_mpu_init ................................... None 0: load ............................................ lm1-1b5-66b 0: local_rank ...................................... None 0: log_batch_size_to_tensorboard ................... True 0: log_interval .................................... 10 0: log_learning_rate_to_tensorboard ................ True 0: log_level ....................................... None 0: log_level_replica ............................... None 0: log_loss_scale_to_tensorboard ................... True 0: log_num_zeros_in_grad ........................... False 0: log_params_norm ................................. False 0: log_path ........................................ None 0: log_timers_to_tensorboard ....................... True 0: log_validation_ppl_to_tensorboard ............... True 0: loss_on_targets_only ............................ False 0: loss_scale ...................................... None 0: loss_scale_window ............................... 1000 0: lr .............................................. 0.0002 0: lr_decay_iters .................................. None 0: lr_decay_samples ................................ 32109839 0: lr_decay_style .................................. cosine 0: lr_decay_tokens ................................. None 0: lr_warmup_fraction .............................. None 0: lr_warmup_iters ................................. 0 0: lr_warmup_samples ............................... 321098 0: make_vocab_size_divisible_by .................... 128 0: mask_prob ....................................... 0.15 0: masked_softmax_fusion ........................... True 0: max_position_embeddings ......................... 2048 0: mean_noise_span_length .......................... None 0: memory_centric_tiled_linear ..................... False 0: merge_file ...................................... gpt2/merges.txt 0: micro_batch_size ................................ 2 0: min_loss_scale .................................. 1.0 0: min_lr .......................................... 2e-05 0: mmap_warmup ..................................... False 0: no_load_optim ................................... None 0: no_load_rng ..................................... None 0: no_save_optim ................................... None 0: no_save_rng ..................................... None 0: noise_density ................................... None 0: num_attention_heads ............................. 16 0: num_channels .................................... 3 0: num_classes ..................................... 1000 0: num_layers ...................................... 28 0: num_layers_per_virtual_pipeline_stage ........... None 0: num_workers ..................................... 2 0: onnx_safe ....................................... None 0: openai_gelu ..................................... False 0: optimizer ....................................... adam 0: optimizer_fusion ................................ True 0: override_lr_scheduler ........................... False 0: pad_vocab_size_to ............................... None 0: params_dtype .................................... torch.bfloat16 0: partition_activations ........................... False 0: patch_dim ....................................... 16 0: pipeline_model_parallel_size .................... 1 0: position_embedding_type ......................... PositionEmbeddingType.absolute 0: pp_partition_method ............................. None 0: profile_backward ................................ False 0: query_in_block_prob ............................. 0.1 0: rampup_batch_size ............................... None 0: rank ............................................ 0 0: remote_device ................................... none 0: reset_attention_mask ............................ False 0: reset_position_ids .............................. False 0: retriever_report_topk_accuracies ................ [] 0: retriever_score_scaling ......................... False 0: retriever_seq_length ............................ 256 0: reweight_loss_based_on_position_frequency ....... False 0: sample_rate ..................................... 1.0 0: save ............................................ lm1-1b5-66b 0: save_interval ................................... 1000 0: scatter_gather_tensors_in_pipeline .............. True 0: scattered_embeddings ............................ False 0: seed ............................................ 1234 0: seq_length ...................................... 2048 0: sgd_momentum .................................... 0.9 0: short_seq_prob .................................. 0.1 0: skip_train_iteration_range ...................... None 0: split ........................................... None 0: split_transformers .............................. False 0: sync_tp_duplicated_parameters ................... False 0: synchronize_each_layer .......................... False 0: tensor_model_parallel_size ...................... 1 0: tensorboard_dir ................................. tensorboard_1b566b66b 0: tensorboard_log_interval ........................ 1 0: tensorboard_queue_size .......................... 5 0: test_weighted_split_paths ....................... None 0: test_weighted_split_paths_path .................. None 0: tile_factor ..................................... 1 0: titles_data_path ................................ None 0: tokenizer_name_or_path .......................... None 0: tokenizer_type .................................. GPT2BPETokenizer 0: train_iters ..................................... None 0: train_samples ................................... 32109839 0: train_tokens .................................... None 0: train_weighted_split_names ...................... ['train'] 0: train_weighted_split_paths ...................... [['/scratch/project_462000119/data/c4_subsampled/gpt2tok_c4_en_12B_text_document']] 0: train_weighted_split_paths_path ................. None 0: train_weighted_split_splits ..................... [['0:1']] 0: train_weighted_split_weights .................... [['1.0']] 0: universal_checkpoint ............................ False 0: use_bnb_optimizer ............................... False 0: use_checkpoint_lr_scheduler ..................... False 0: use_contiguous_buffers_in_ddp ................... True 0: use_cpu_initialization .......................... None 0: use_one_sent_docs ............................... False 0: use_pin_memory .................................. False 0: valid_num_workers ............................... 2 0: valid_weighted_split_names ...................... ['validation'] 0: valid_weighted_split_paths ...................... [['/scratch/project_462000119/data/c4_validation/gpt2tok_c4validation_rerun_text_document']] 0: valid_weighted_split_paths_path ................. None 0: valid_weighted_split_splits ..................... [['0:1']] 0: valid_weighted_split_weights .................... [['1.0']] 0: virtual_pipeline_model_parallel_size ............ None 0: vocab_extra_ids ................................. 0 0: vocab_file ...................................... gpt2/vocab.json 0: weight_decay .................................... 0.1 0: world_size ...................................... 128 0: zero_allgather_bucket_size ...................... 0.0 0: zero_contigious_gradients ....................... False 0: zero_reduce_bucket_size ......................... 0.0 0: zero_reduce_scatter ............................. False 0: zero_stage ...................................... 0 0: -------------------- end of arguments --------------------- 0: setting number of micro-batches to constant 1 0: > building GPT2BPETokenizer tokenizer ... 0: > padded vocab (size: 50257) with 47 dummy tokens (new size: 50304) 0: DeepSpeed general environment info: 0: torch install path ............... ['/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/venv/lib/python3.9/site-packages/torch'] 0: torch version .................... 1.13.0+rocm5.2 0: torch cuda version ............... None 0: torch hip version ................ 5.2.21151-afdc89f8 0: nvcc version ..................... None 0: deepspeed install path ........... ['/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/venv/lib/python3.9/site-packages/deepspeed'] 0: deepspeed info ................... 0.7.5, unknown, unknown 0: deepspeed wheel compiled w. ...... torch 1.13, hip 5.1 0: **** Git info for Megatron: git_hash=unknown git_branch=unknown **** 0: > initializing torch distributed ... 0: [2023-02-03 16:34:09,339] [INFO] [comm.py:633:init_distributed] Initializing TorchBackend in DeepSpeed with backend nccl 15: > setting tensorboard ... 0: > initializing tensor model parallel with size 1 0: > initializing pipeline model parallel with size 1 0: > setting random seeds to 1234 ... 0: > initializing model parallel cuda seeds on global rank 0, model parallel rank 0, and data parallel rank 0 with model parallel seed: 3952 and data parallel seed: 1234 0: > compiling dataset index builder ... 0: make: Entering directory '/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/data' 0: make: Nothing to be done for 'default'. 0: make: Leaving directory '/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/data' 0: >>> done with dataset index builder. Compilation time: 0.134 seconds 0: > compiling and loading fused kernels ... 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax.cpp -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_hip.cpp [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_hip.h [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_cuda.cu -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_hip.hip [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_hip.h [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax_hip.h [skipped, already hipified] 0: Total number of unsupported CUDA function calls: 0 0: 0: 0: Total number of replaced kernel launches: 87 0: ninja: no work to do. 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax.cpp -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax_hip.cpp [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax_cuda.cu -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax_hip.hip [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_hip.h [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax_hip.h [skipped, already hipified] 0: Total number of unsupported CUDA function calls: 0 0: 0: 0: Total number of replaced kernel launches: 63 0: ninja: no work to do. 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/layer_norm_cuda.cpp -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/layer_norm_cuda.cpp [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/layer_norm_cuda_kernel.cu -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/layer_norm_hip_kernel.hip [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_hip.h [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax_hip.h [skipped, already hipified] 0: Total number of unsupported CUDA function calls: 0 0: 0: 0: Total number of replaced kernel launches: 67 0: ninja: no work to do. 0: >>> done with compiling and loading fused kernels. Compilation time: 26.782 seconds 0: time to initialize megatron (seconds): 1.784 0: [after megatron is initialized] datetime: 2023-02-03 16:34:41 0: building GPT model ... 0: [2023-02-03 16:34:41,919] [INFO] [utils.py:827:see_memory_usage] Before Building Model 0: [2023-02-03 16:34:41,919] [INFO] [utils.py:828:see_memory_usage] MA 0.0 GB Max_MA 0.0 GB CA 0.0 GB Max_CA 0 GB 0: [2023-02-03 16:34:41,920] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 42.47 GB, percent = 8.4% 0: SEED_LAYERS=False BASE_SEED=1234 SEED_FN=None 0: Using topology: {ProcessCoord(pipe=0, data=0, model=0): 0, ProcessCoord(pipe=0, data=1, model=0): 1, ProcessCoord(pipe=0, data=2, model=0): 2, ProcessCoord(pipe=0, data=3, model=0): 3, ProcessCoord(pipe=0, data=4, model=0): 4, ProcessCoord(pipe=0, data=5, model=0): 5, ProcessCoord(pipe=0, data=6, model=0): 6, ProcessCoord(pipe=0, data=7, model=0): 7, ProcessCoord(pipe=0, data=8, model=0): 8, ProcessCoord(pipe=0, data=9, model=0): 9, ProcessCoord(pipe=0, data=10, model=0): 10, ProcessCoord(pipe=0, data=11, model=0): 11, ProcessCoord(pipe=0, data=12, model=0): 12, ProcessCoord(pipe=0, data=13, model=0): 13, ProcessCoord(pipe=0, data=14, model=0): 14, ProcessCoord(pipe=0, data=15, model=0): 15, ProcessCoord(pipe=0, data=16, model=0): 16, ProcessCoord(pipe=0, data=17, model=0): 17, ProcessCoord(pipe=0, data=18, model=0): 18, ProcessCoord(pipe=0, data=19, model=0): 19, ProcessCoord(pipe=0, data=20, model=0): 20, ProcessCoord(pipe=0, data=21, model=0): 21, ProcessCoord(pipe=0, data=22, model=0): 22, ProcessCoord(pi 0: pe=0, data=23, model=0): 23, ProcessCoord(pipe=0, data=24, model=0): 24, ProcessCoord(pipe=0, data=25, model=0): 25, ProcessCoord(pipe=0, data=26, model=0): 26, ProcessCoord(pipe=0, data=27, model=0): 27, ProcessCoord(pipe=0, data=28, model=0): 28, ProcessCoord(pipe=0, data=29, model=0): 29, ProcessCoord(pipe=0, data=30, model=0): 30, ProcessCoord(pipe=0, data=31, model=0): 31, ProcessCoord(pipe=0, data=32, model=0): 32, ProcessCoord(pipe=0, data=33, model=0): 33, ProcessCoord(pipe=0, data=34, model=0): 34, ProcessCoord(pipe=0, data=35, model=0): 35, ProcessCoord(pipe=0, data=36, model=0): 36, ProcessCoord(pipe=0, data=37, model=0): 37, ProcessCoord(pipe=0, data=38, model=0): 38, ProcessCoord(pipe=0, data=39, model=0): 39, ProcessCoord(pipe=0, data=40, model=0): 40, ProcessCoord(pipe=0, data=41, model=0): 41, ProcessCoord(pipe=0, data=42, model=0): 42, ProcessCoord(pipe=0, data=43, model=0): 43, ProcessCoord(pipe=0, data=44, model=0): 44, ProcessCoord(pipe=0, data=45, model=0): 45, ProcessCoord(pipe=0, data=4 0: 6, model=0): 46, ProcessCoord(pipe=0, data=47, model=0): 47, ProcessCoord(pipe=0, data=48, model=0): 48, ProcessCoord(pipe=0, data=49, model=0): 49, ProcessCoord(pipe=0, data=50, model=0): 50, ProcessCoord(pipe=0, data=51, model=0): 51, ProcessCoord(pipe=0, data=52, model=0): 52, ProcessCoord(pipe=0, data=53, model=0): 53, ProcessCoord(pipe=0, data=54, model=0): 54, ProcessCoord(pipe=0, data=55, model=0): 55, ProcessCoord(pipe=0, data=56, model=0): 56, ProcessCoord(pipe=0, data=57, model=0): 57, ProcessCoord(pipe=0, data=58, model=0): 58, ProcessCoord(pipe=0, data=59, model=0): 59, ProcessCoord(pipe=0, data=60, model=0): 60, ProcessCoord(pipe=0, data=61, model=0): 61, ProcessCoord(pipe=0, data=62, model=0): 62, ProcessCoord(pipe=0, data=63, model=0): 63, ProcessCoord(pipe=0, data=64, model=0): 64, ProcessCoord(pipe=0, data=65, model=0): 65, ProcessCoord(pipe=0, data=66, model=0): 66, ProcessCoord(pipe=0, data=67, model=0): 67, ProcessCoord(pipe=0, data=68, model=0): 68, ProcessCoord(pipe=0, data=69, model=0): 0: 69, ProcessCoord(pipe=0, data=70, model=0): 70, ProcessCoord(pipe=0, data=71, model=0): 71, ProcessCoord(pipe=0, data=72, model=0): 72, ProcessCoord(pipe=0, data=73, model=0): 73, ProcessCoord(pipe=0, data=74, model=0): 74, ProcessCoord(pipe=0, data=75, model=0): 75, ProcessCoord(pipe=0, data=76, model=0): 76, ProcessCoord(pipe=0, data=77, model=0): 77, ProcessCoord(pipe=0, data=78, model=0): 78, ProcessCoord(pipe=0, data=79, model=0): 79, ProcessCoord(pipe=0, data=80, model=0): 80, ProcessCoord(pipe=0, data=81, model=0): 81, ProcessCoord(pipe=0, data=82, model=0): 82, ProcessCoord(pipe=0, data=83, model=0): 83, ProcessCoord(pipe=0, data=84, model=0): 84, ProcessCoord(pipe=0, data=85, model=0): 85, ProcessCoord(pipe=0, data=86, model=0): 86, ProcessCoord(pipe=0, data=87, model=0): 87, ProcessCoord(pipe=0, data=88, model=0): 88, ProcessCoord(pipe=0, data=89, model=0): 89, ProcessCoord(pipe=0, data=90, model=0): 90, ProcessCoord(pipe=0, data=91, model=0): 91, ProcessCoord(pipe=0, data=92, model=0): 92, Process 0: Coord(pipe=0, data=93, model=0): 93, ProcessCoord(pipe=0, data=94, model=0): 94, ProcessCoord(pipe=0, data=95, model=0): 95, ProcessCoord(pipe=0, data=96, model=0): 96, ProcessCoord(pipe=0, data=97, model=0): 97, ProcessCoord(pipe=0, data=98, model=0): 98, ProcessCoord(pipe=0, data=99, model=0): 99, ProcessCoord(pipe=0, data=100, model=0): 100, ProcessCoord(pipe=0, data=101, model=0): 101, ProcessCoord(pipe=0, data=102, model=0): 102, ProcessCoord(pipe=0, data=103, model=0): 103, ProcessCoord(pipe=0, data=104, model=0): 104, ProcessCoord(pipe=0, data=105, model=0): 105, ProcessCoord(pipe=0, data=106, model=0): 106, ProcessCoord(pipe=0, data=107, model=0): 107, ProcessCoord(pipe=0, data=108, model=0): 108, ProcessCoord(pipe=0, data=109, model=0): 109, ProcessCoord(pipe=0, data=110, model=0): 110, ProcessCoord(pipe=0, data=111, model=0): 111, ProcessCoord(pipe=0, data=112, model=0): 112, ProcessCoord(pipe=0, data=113, model=0): 113, ProcessCoord(pipe=0, data=114, model=0): 114, ProcessCoord(pipe=0, data=115, mo 0: del=0): 115, ProcessCoord(pipe=0, data=116, model=0): 116, ProcessCoord(pipe=0, data=117, model=0): 117, ProcessCoord(pipe=0, data=118, model=0): 118, ProcessCoord(pipe=0, data=119, model=0): 119, ProcessCoord(pipe=0, data=120, model=0): 120, ProcessCoord(pipe=0, data=121, model=0): 121, ProcessCoord(pipe=0, data=122, model=0): 122, ProcessCoord(pipe=0, data=123, model=0): 123, ProcessCoord(pipe=0, data=124, model=0): 124, ProcessCoord(pipe=0, data=125, model=0): 125, ProcessCoord(pipe=0, data=126, model=0): 126, ProcessCoord(pipe=0, data=127, model=0): 127} 0: [2023-02-03 16:34:46,031] [INFO] [module.py:366:_partition_layers] Partitioning pipeline stages with method type:transformer 0: stage=0 layers=35 0: 0: _to_float16 0: 1: EmbeddingPipe 0: 2: 0: 3: ParallelTransformerLayerPipe 0: 4: ParallelTransformerLayerPipe 0: 5: ParallelTransformerLayerPipe 0: 6: ParallelTransformerLayerPipe 0: 7: ParallelTransformerLayerPipe 0: 8: ParallelTransformerLayerPipe 0: 9: ParallelTransformerLayerPipe 0: 10: ParallelTransformerLayerPipe 0: 11: ParallelTransformerLayerPipe 0: 12: ParallelTransformerLayerPipe 0: 13: ParallelTransformerLayerPipe 0: 14: ParallelTransformerLayerPipe 0: 15: ParallelTransformerLayerPipe 0: 16: ParallelTransformerLayerPipe 0: 17: ParallelTransformerLayerPipe 0: 18: ParallelTransformerLayerPipe 0: 19: ParallelTransformerLayerPipe 0: 20: ParallelTransformerLayerPipe 0: 21: ParallelTransformerLayerPipe 0: 22: ParallelTransformerLayerPipe 0: 23: ParallelTransformerLayerPipe 0: 24: ParallelTransformerLayerPipe 0: 25: ParallelTransformerLayerPipe 0: 26: ParallelTransformerLayerPipe 0: 27: ParallelTransformerLayerPipe 0: 28: ParallelTransformerLayerPipe 0: 29: ParallelTransformerLayerPipe 0: 30: ParallelTransformerLayerPipe 0: 31: undo 0: 32: MixedFusedLayerNorm 0: 33: EmbeddingPipe 0: 34: float16_to_fp32 0: loss: CrossEntropy 0: [2023-02-03 16:34:46,556] [INFO] [utils.py:827:see_memory_usage] After Building Model 0: [2023-02-03 16:34:46,556] [INFO] [utils.py:828:see_memory_usage] MA 2.83 GB Max_MA 2.83 GB CA 2.89 GB Max_CA 3 GB 0: [2023-02-03 16:34:46,556] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 42.51 GB, percent = 8.4% 0: setting training iterations to 125429 0: > learning rate decay style: cosine 0: DeepSpeed is enabled. 0: [2023-02-03 16:34:46,559] [INFO] [logging.py:68:log_dist] [Rank 0] DeepSpeed info: version=0.7.5, git-hash=unknown, git-branch=unknown 0: [2023-02-03 16:35:02,167] [INFO] [logging.py:68:log_dist] [Rank 0] DeepSpeed Flops Profiler Enabled: False 0: [2023-02-03 16:35:02,167] [INFO] [logging.py:68:log_dist] [Rank 0] Removing param_group that has no 'params' in the client Optimizer 0: [2023-02-03 16:35:02,167] [INFO] [logging.py:68:log_dist] [Rank 0] Using client Optimizer as basic optimizer 0: [2023-02-03 16:35:02,181] [INFO] [logging.py:68:log_dist] [Rank 0] DeepSpeed Basic Optimizer = FusedAdam 0: [2023-02-03 16:35:02,181] [INFO] [logging.py:68:log_dist] [Rank 0] Creating BF16 optimizer 0: [2023-02-03 16:35:02,300] [INFO] [utils.py:827:see_memory_usage] begin bf16_optimizer 0: [2023-02-03 16:35:02,301] [INFO] [utils.py:828:see_memory_usage] MA 2.83 GB Max_MA 2.84 GB CA 2.91 GB Max_CA 3 GB 0: [2023-02-03 16:35:02,301] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 43.19 GB, percent = 8.6% 5: ninja: no work to do. 5: Time to load utils op: 0.3095254898071289 seconds 0: ninja: no work to do. 0: Time to load utils op: 0.15243744850158691 seconds 12: Time to load utils op: 0.412311315536499 seconds 10: Time to load utils op: 0.415149450302124 seconds 14: Time to load utils op: 0.4223041534423828 seconds 9: Time to load utils op: 0.4232616424560547 seconds 5: Time to load utils op: 0.0006165504455566406 seconds 0: Time to load utils op: 0.20262718200683594 secondsTime to load utils op: 0.20243000984191895 seconds 0: 0: Time to load utils op: 0.20253324508666992 seconds 0: Time to load utils op: 0.2027912139892578 seconds 0: Time to load utils op: 0.20321440696716309 seconds 0: Time to load utils op: 0.20301508903503418 seconds 5: Time to load utils op: 0.20300817489624023 seconds 5: Time to load utils op: 0.2021784782409668 seconds 5: Time to load utils op: 0.20232844352722168 secondsTime to load utils op: 0.20239472389221191 secondsTime to load utils op: 0.20238757133483887 seconds 5: 5: 5: Time to load utils op: 0.2015829086303711 seconds 5: Time to load utils op: 0.20223116874694824 seconds 9: Time to load utils op: 0.20267486572265625 seconds 9: Time to load utils op: 0.20230531692504883 seconds 10: Time to load utils op: 0.20404863357543945 seconds 10: Time to load utils op: 0.20380759239196777 seconds 10: Time to load utils op: 0.20426225662231445 seconds 10: Time to load utils op: 0.20433425903320312 seconds 10: Time to load utils op: 0.20470571517944336 seconds 10: Time to load utils op: 0.2047426700592041 seconds 10: Time to load utils op: 0.20454645156860352 seconds 12: Time to load utils op: 0.20375823974609375 seconds 12: Time to load utils op: 0.20379996299743652 seconds 12: Time to load utils op: 0.20310425758361816 seconds 12: Time to load utils op: 0.20345592498779297 seconds 12: Time to load utils op: 0.2035808563232422 seconds 12: Time to load utils op: 0.20393037796020508 seconds 12: Time to load utils op: 0.2042527198791504 seconds 2: Time to load utils op: 0.2161879539489746 seconds 2: Time to load utils op: 0.21582937240600586 seconds 2: Time to load utils op: 0.21606040000915527 secondsTime to load utils op: 0.21561145782470703 seconds 2: 2: Time to load utils op: 0.2155742645263672 secondsTime to load utils op: 0.21556663513183594 secondsTime to load utils op: 0.21559429168701172 seconds 2: 2: 2: Time to load utils op: 0.21558809280395508 seconds 4: Time to load utils op: 0.2143557071685791 seconds 4: Time to load utils op: 0.21413207054138184 secondsTime to load utils op: 0.2144615650177002 seconds 4: 4: Time to load utils op: 0.21442103385925293 seconds 4: Time to load utils op: 0.21386289596557617 secondsTime to load utils op: 0.2144005298614502 seconds 4: 4: Time to load utils op: 0.21422219276428223 seconds 9: Time to load utils op: 0.202376127243042 seconds 1: Time to load utils op: 0.2147819995880127 seconds 1: Time to load utils op: 0.21485328674316406 seconds 1: Time to load utils op: 0.2148265838623047 seconds 1: Time to load utils op: 0.2148733139038086 secondsTime to load utils op: 0.21485567092895508 seconds 1: 1: Time to load utils op: 0.214890718460083 seconds 1: Time to load utils op: 0.21486115455627441 seconds 1: Time to load utils op: 0.21489977836608887 seconds 8: Time to load utils op: 0.21212983131408691 seconds 9: Time to load utils op: 0.20235753059387207 seconds 8: Time to load utils op: 0.21243023872375488 seconds 8: Time to load utils op: 0.2126293182373047 seconds 8: Time to load utils op: 0.21163105964660645 seconds 8: Time to load utils op: 0.21200013160705566 secondsTime to load utils op: 0.21156024932861328 seconds 8: 8: Time to load utils op: 0.2121260166168213 seconds 9: Time to load utils op: 0.20264554023742676 seconds 14: Time to load utils op: 0.20456361770629883 seconds 14: Time to load utils op: 0.20467495918273926 seconds 14: Time to load utils op: 0.20409941673278809 seconds 14: Time to load utils op: 0.20431280136108398 seconds 14: Time to load utils op: 0.20499491691589355 seconds 7: Time to load utils op: 0.21380400657653809 secondsTime to load utils op: 0.21379661560058594 seconds 7: 7: Time to load utils op: 0.21355056762695312 seconds 14: Time to load utils op: 0.2051856517791748 seconds 7: Time to load utils op: 0.21383905410766602 secondsTime to load utils op: 0.2137308120727539 seconds 7: 7: Time to load utils op: 0.2134559154510498 seconds 7: Time to load utils op: 0.21369194984436035 seconds 14: Time to load utils op: 0.20474553108215332 seconds 6: Time to load utils op: 0.21074247360229492 seconds 6: Time to load utils op: 0.21074652671813965 seconds 6: Time to load utils op: 0.21073198318481445 secondsTime to load utils op: 0.21074461936950684 seconds 6: 6: Time to load utils op: 0.21076297760009766 secondsTime to load utils op: 0.21081209182739258 seconds 6: 6: Time to load utils op: 0.2108016014099121 seconds 6: Time to load utils op: 0.21077823638916016 seconds 9: Time to load utils op: 0.20248198509216309 seconds 3: Time to load utils op: 0.21550703048706055 seconds 3: Time to load utils op: 0.21553659439086914 seconds 3: Time to load utils op: 0.2155771255493164 secondsTime to load utils op: 0.21555876731872559 seconds 3: 3: Time to load utils op: 0.21558380126953125 seconds 3: Time to load utils op: 0.21558642387390137 seconds 3: Time to load utils op: 0.21559691429138184 seconds 3: Time to load utils op: 0.2156071662902832 seconds 0: Time to load utils op: 0.404590368270874 seconds 5: Time to load utils op: 0.0004279613494873047 seconds 5: Time to load utils op: 0.0003864765167236328 seconds 11: Time to load utils op: 0.21105718612670898 seconds 11: Time to load utils op: 0.2110743522644043 seconds 5: Time to load utils op: 0.0003714561462402344 seconds 11: Time to load utils op: 0.21107697486877441 seconds 11: Time to load utils op: 0.21110081672668457 secondsTime to load utils op: 0.2111048698425293 seconds 11: 11: Time to load utils op: 0.21111512184143066 secondsTime to load utils op: 0.2111072540283203 seconds 11: 11: Time to load utils op: 0.2111222743988037 seconds 5: Time to load utils op: 0.00037980079650878906 seconds 5: Time to load utils op: 0.0004086494445800781 seconds 5: Time to load utils op: 0.0003833770751953125 seconds 5: Time to load utils op: 0.0003228187561035156 seconds 13: Time to load utils op: 0.2108001708984375 seconds 13: Time to load utils op: 0.2108325958251953 seconds 13: Time to load utils op: 0.21080946922302246 seconds 13: Time to load utils op: 0.21081185340881348 seconds 13: Time to load utils op: 0.21081137657165527 seconds 13: Time to load utils op: 0.21086549758911133 seconds 13: Time to load utils op: 0.21082615852355957 seconds 13: Time to load utils op: 0.20865511894226074 seconds 9: Time to load utils op: 0.2023763656616211 seconds 15: Time to load utils op: 0.2113208770751953 secondsTime to load utils op: 0.2111356258392334 seconds 15: Time to load utils op: 0.21113324165344238 secondsTime to load utils op: 0.21114063262939453 secondsTime to load utils op: 0.21114063262939453 seconds 15: 15: 15: 15: Time to load utils op: 0.2112293243408203 seconds 15: Time to load utils op: 0.21112942695617676 seconds 15: Time to load utils op: 0.21123266220092773 seconds 4: Time to load utils op: 0.5043551921844482 seconds 8: Time to load utils op: 0.5047247409820557 seconds 7: Time to load utils op: 0.504511833190918 seconds 0: Time to load utils op: 0.0011315345764160156 seconds 0: Time to load utils op: 0.0011548995971679688 seconds 0: Time to load utils op: 0.0009396076202392578 seconds 0: Time to load utils op: 0.0012633800506591797 seconds 0: Time to load utils op: 0.0013108253479003906 seconds 0: Time to load utils op: 0.0013012886047363281 seconds 0: Time to load utils op: 0.0013573169708251953 seconds 12: Time to load utils op: 0.0005104541778564453 seconds 14: Time to load utils op: 0.0005271434783935547 seconds 12: Time to load utils op: 0.00039649009704589844 seconds 12: Time to load utils op: 0.0004107952117919922 seconds 14: Time to load utils op: 0.0005006790161132812 seconds 12: Time to load utils op: 0.0005171298980712891 seconds 14: Time to load utils op: 0.0006289482116699219 secondsTime to load utils op: 0.0006003379821777344 seconds 14: 12: Time to load utils op: 0.0005183219909667969 seconds 14: Time to load utils op: 0.0005450248718261719 seconds 14: Time to load utils op: 0.0005736351013183594 secondsTime to load utils op: 0.000545501708984375 seconds 14: 12: Time to load utils op: 0.0005409717559814453 seconds 14: Time to load utils op: 0.0005960464477539062 seconds 12: Time to load utils op: 0.0006630420684814453 seconds 12: Time to load utils op: 0.0006220340728759766 seconds 10: Time to load utils op: 0.0004673004150390625 seconds 10: Time to load utils op: 0.0005469322204589844 seconds 10: Time to load utils op: 0.0004248619079589844 seconds 10: Time to load utils op: 0.00044345855712890625 secondsTime to load utils op: 0.0005159378051757812 secondsTime to load utils op: 0.0004901885986328125 seconds 10: Time to load utils op: 0.00045180320739746094 seconds 10: 10: 10: Time to load utils op: 0.0004286766052246094 seconds 8: Time to load utils op: 0.0005450248718261719 seconds 8: Time to load utils op: 0.00055694580078125 seconds 8: Time to load utils op: 0.0005807876586914062 seconds 8: Time to load utils op: 0.0005671977996826172 seconds 9: Time to load utils op: 0.0005738735198974609 seconds 8: Time to load utils op: 0.0005643367767333984 secondsTime to load utils op: 0.00054931640625 seconds 8: 8: Time to load utils op: 0.0006070137023925781 seconds 8: Time to load utils op: 0.0005552768707275391 seconds 9: Time to load utils op: 0.00037097930908203125 seconds 9: Time to load utils op: 0.00041961669921875 seconds 9: Time to load utils op: 0.00047135353088378906 seconds 9: Time to load utils op: 0.0004298686981201172 seconds 9: Time to load utils op: 0.00044226646423339844 seconds 9: Time to load utils op: 0.00047135353088378906 seconds 9: Time to load utils op: 0.0004382133483886719 seconds 2: Time to load utils op: 0.0010559558868408203 seconds 2: Time to load utils op: 0.0009367465972900391 seconds 2: Time to load utils op: 0.0011789798736572266 seconds 2: Time to load utils op: 0.001384735107421875 seconds 2: Time to load utils op: 0.001348257064819336 seconds 2: Time to load utils op: 0.001285552978515625 secondsTime to load utils op: 0.0013680458068847656 seconds 2: 2: Time to load utils op: 0.0013551712036132812 seconds 1: Time to load utils op: 0.0007004737854003906 seconds 0: [2023-02-03 16:35:02,830] [INFO] [utils.py:827:see_memory_usage] before initializing group 0 15: Time to load utils op: 0.0011200904846191406 seconds 15: Time to load utils op: 0.000982522964477539 seconds 1: Time to load utils op: 0.0010173320770263672 seconds 1: Time to load utils op: 0.0009872913360595703 seconds 1: Time to load utils op: 0.0009927749633789062 seconds 11: Time to load utils op: 0.0007414817810058594 seconds 0: [2023-02-03 16:35:02,831] [INFO] [utils.py:828:see_memory_usage] MA 2.83 GB Max_MA 2.83 GB CA 2.91 GB Max_CA 3 GB 15: Time to load utils op: 0.0012362003326416016 seconds 1: Time to load utils op: 0.0012154579162597656 secondsTime to load utils op: 0.0011868476867675781 seconds 1: 1: Time to load utils op: 0.0012159347534179688 seconds 0: [2023-02-03 16:35:02,831] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 43.35 GB, percent = 8.6% 1: Time to load utils op: 0.0012619495391845703 seconds 13: Time to load utils op: 0.00047135353088378906 seconds 15: Time to load utils op: 0.001402139663696289 secondsTime to load utils op: 0.001318216323852539 seconds 15: Time to load utils op: 0.001344919204711914 seconds 15: 11: Time to load utils op: 0.0010838508605957031 seconds 11: Time to load utils op: 0.0011184215545654297 seconds 15: Time to load utils op: 0.001331329345703125 seconds 11: Time to load utils op: 0.0011165142059326172 seconds 15: Time to load utils op: 0.0014553070068359375 seconds 13: Time to load utils op: 0.00042939186096191406 secondsTime to load utils op: 0.0004055500030517578 seconds 13: 13: Time to load utils op: 0.0004937648773193359 seconds 13: Time to load utils op: 0.0003943443298339844 seconds 13: Time to load utils op: 0.0003936290740966797 seconds 11: Time to load utils op: 0.0012593269348144531 seconds 13: Time to load utils op: 0.00049591064453125 seconds 11: Time to load utils op: 0.0013015270233154297 seconds 11: Time to load utils op: 0.0012106895446777344 seconds 6: Time to load utils op: 0.0007855892181396484 seconds 13: Time to load utils op: 0.00041985511779785156 seconds 11: Time to load utils op: 0.0013027191162109375 seconds 7: Time to load utils op: 0.0004980564117431641 seconds 4: Time to load utils op: 0.00047397613525390625 seconds 7: Time to load utils op: 0.0006029605865478516 seconds 7: Time to load utils op: 0.0006351470947265625 seconds 7: Time to load utils op: 0.0006284713745117188 secondsTime to load utils op: 0.0005948543548583984 secondsTime to load utils op: 0.0005803108215332031 seconds 7: 7: 4: Time to load utils op: 0.00045299530029296875 secondsTime to load utils op: 0.00044655799865722656 secondsTime to load utils op: 0.0004260540008544922 seconds 4: 4: Time to load utils op: 0.0004436969757080078 seconds 4: 4: Time to load utils op: 0.0004360675811767578 seconds 4: Time to load utils op: 0.0004069805145263672 seconds 7: Time to load utils op: 0.0007097721099853516 seconds 7: Time to load utils op: 0.0007207393646240234 seconds 4: Time to load utils op: 0.0005426406860351562 seconds 6: Time to load utils op: 0.0013687610626220703 secondsTime to load utils op: 0.0012254714965820312 seconds 6: 6: Time to load utils op: 0.0012161731719970703 secondsTime to load utils op: 0.0013206005096435547 seconds 6: 6: Time to load utils op: 0.0013246536254882812 secondsTime to load utils op: 0.0012869834899902344 seconds 6: 6: Time to load utils op: 0.001399993896484375 seconds 3: Time to load utils op: 0.0008695125579833984 seconds 3: Time to load utils op: 0.0008845329284667969 seconds 3: Time to load utils op: 0.0010228157043457031 secondsTime to load utils op: 0.0010218620300292969 seconds 3: 3: Time to load utils op: 0.0011744499206542969 seconds 3: Time to load utils op: 0.0011477470397949219 seconds 3: Time to load utils op: 0.0011475086212158203 seconds 3: Time to load utils op: 0.0012176036834716797 seconds 0: [2023-02-03 16:35:02,945] [INFO] [utils.py:827:see_memory_usage] after initializing group 0 0: [2023-02-03 16:35:02,945] [INFO] [utils.py:828:see_memory_usage] MA 5.81 GB Max_MA 5.81 GB CA 7.36 GB Max_CA 7 GB 0: [2023-02-03 16:35:02,946] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 43.35 GB, percent = 8.6% 0: [2023-02-03 16:35:03,047] [INFO] [utils.py:827:see_memory_usage] before initializing group 1 0: [2023-02-03 16:35:03,047] [INFO] [utils.py:828:see_memory_usage] MA 5.81 GB Max_MA 5.81 GB CA 7.36 GB Max_CA 7 GB 0: [2023-02-03 16:35:03,047] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 43.35 GB, percent = 8.6% 0: [2023-02-03 16:35:03,150] [INFO] [utils.py:827:see_memory_usage] after initializing group 1 0: [2023-02-03 16:35:03,150] [INFO] [utils.py:828:see_memory_usage] MA 8.52 GB Max_MA 8.52 GB CA 11.39 GB Max_CA 11 GB 0: [2023-02-03 16:35:03,150] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 43.35 GB, percent = 8.6% 0: [2023-02-03 16:35:03,250] [INFO] [utils.py:827:see_memory_usage] before initializing group 2 0: [2023-02-03 16:35:03,251] [INFO] [utils.py:828:see_memory_usage] MA 8.52 GB Max_MA 8.52 GB CA 11.39 GB Max_CA 11 GB 0: [2023-02-03 16:35:03,251] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 43.35 GB, percent = 8.6% 0: [2023-02-03 16:35:03,355] [INFO] [utils.py:827:see_memory_usage] after initializing group 2 0: [2023-02-03 16:35:03,356] [INFO] [utils.py:828:see_memory_usage] MA 8.52 GB Max_MA 8.52 GB CA 11.39 GB Max_CA 11 GB 0: [2023-02-03 16:35:03,356] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 43.35 GB, percent = 8.6% 0: [2023-02-03 16:35:03,455] [INFO] [utils.py:827:see_memory_usage] before initialize_optimizer 0: [2023-02-03 16:35:03,456] [INFO] [utils.py:828:see_memory_usage] MA 8.52 GB Max_MA 8.52 GB CA 11.39 GB Max_CA 11 GB 0: [2023-02-03 16:35:03,456] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 43.35 GB, percent = 8.6% 0: [2023-02-03 16:35:03,561] [INFO] [utils.py:827:see_memory_usage] end initialize_optimizer 0: [2023-02-03 16:35:03,562] [INFO] [utils.py:828:see_memory_usage] MA 8.61 GB Max_MA 8.61 GB CA 11.39 GB Max_CA 11 GB 0: [2023-02-03 16:35:03,562] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 43.35 GB, percent = 8.6% 0: [2023-02-03 16:35:03,661] [INFO] [utils.py:827:see_memory_usage] end bf16_optimizer 0: [2023-02-03 16:35:03,662] [INFO] [utils.py:828:see_memory_usage] MA 8.61 GB Max_MA 8.61 GB CA 11.39 GB Max_CA 11 GB 0: [2023-02-03 16:35:03,662] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 43.35 GB, percent = 8.6% 0: [2023-02-03 16:35:03,662] [INFO] [logging.py:68:log_dist] [Rank 0] DeepSpeed Final Optimizer = FusedAdam 0: [2023-02-03 16:35:03,662] [INFO] [logging.py:68:log_dist] [Rank 0] DeepSpeed using client LR scheduler 0: [2023-02-03 16:35:03,663] [INFO] [logging.py:68:log_dist] [Rank 0] DeepSpeed LR Scheduler = 0: [2023-02-03 16:35:03,663] [INFO] [logging.py:68:log_dist] [Rank 0] step=0, skipped=0, lr=[0.0, 0.0, 0.0], mom=[(0.9, 0.999), (0.9, 0.999), (0.9, 0.999)] 0: [2023-02-03 16:35:03,663] [INFO] [config.py:1007:print] DeepSpeedEngine configuration: 0: [2023-02-03 16:35:03,663] [INFO] [config.py:1011:print] activation_checkpointing_config { 0: "partition_activations": false, 0: "contiguous_memory_optimization": false, 0: "cpu_checkpointing": false, 0: "number_checkpoints": null, 0: "synchronize_checkpoint_boundary": false, 0: "profile": false 0: } 0: [2023-02-03 16:35:03,663] [INFO] [config.py:1011:print] aio_config ................... {'block_size': 1048576, 'queue_depth': 8, 'thread_count': 1, 'single_submit': False, 'overlap_events': True} 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] amp_enabled .................. False 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] amp_params ................... False 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] autotuning_config ............ { 0: "enabled": false, 0: "start_step": null, 0: "end_step": null, 0: "metric_path": null, 0: "arg_mappings": null, 0: "metric": "throughput", 0: "model_info": null, 0: "results_dir": "/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/autotuning_results", 0: "exps_dir": "/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/autotuning_exps", 0: "overwrite": true, 0: "fast": true, 0: "start_profile_step": 3, 0: "end_profile_step": 5, 0: "tuner_type": "gridsearch", 0: "tuner_early_stopping": 5, 0: "tuner_num_trials": 50, 0: "model_info_path": null, 0: "mp_size": 1, 0: "max_train_batch_size": null, 0: "min_train_batch_size": 1, 0: "max_train_micro_batch_size_per_gpu": 1.024000e+03, 0: "min_train_micro_batch_size_per_gpu": 1, 0: "num_tuning_micro_batch_sizes": 3 0: } 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] bfloat16_enabled ............. True 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] checkpoint_parallel_write_pipeline False 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] checkpoint_tag_validation_enabled True 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] checkpoint_tag_validation_fail False 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] comms_config ................. 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] communication_data_type ...... None 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] compression_config ........... {'weight_quantization': {'shared_parameters': {'enabled': False, 'quantizer_kernel': False, 'schedule_offset': 0, 'quantize_groups': 1, 'quantize_verbose': False, 'quantization_type': 'symmetric', 'quantize_weight_in_forward': False, 'rounding': 'nearest', 'fp16_mixed_quantize': False, 'quantize_change_ratio': 0.001}, 'different_groups': {}}, 'activation_quantization': {'shared_parameters': {'enabled': False, 'quantization_type': 'symmetric', 'range_calibration': 'dynamic', 'schedule_offset': 1000}, 'different_groups': {}}, 'sparse_pruning': {'shared_parameters': {'enabled': False, 'method': 'l1', 'schedule_offset': 1000}, 'different_groups': {}}, 'row_pruning': {'shared_parameters': {'enabled': False, 'method': 'l1', 'schedule_offset': 1000}, 'different_groups': {}}, 'head_pruning': {'shared_parameters': {'enabled': False, 'method': 'topk', 'schedule_offset': 1000}, 'different_groups': {}}, 'channel_pruning': {'shared_pa 0: rameters': {'enabled': False, 'method': 'l1', 'schedule_offset': 1000}, 'different_groups': {}}, 'layer_reduction': {'enabled': False}} 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] curriculum_enabled ........... False 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] curriculum_params ............ False 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] dataloader_drop_last ......... False 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] disable_allgather ............ False 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] dump_state ................... False 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] dynamic_loss_scale_args ...... None 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] eigenvalue_enabled ........... False 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] eigenvalue_gas_boundary_resolution 1 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] eigenvalue_layer_name ........ bert.encoder.layer 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] eigenvalue_layer_num ......... 0 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] eigenvalue_max_iter .......... 100 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] eigenvalue_stability ......... 1e-06 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] eigenvalue_tol ............... 0.01 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] eigenvalue_verbose ........... False 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] elasticity_enabled ........... False 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] flops_profiler_config ........ { 0: "enabled": false, 0: "profile_step": 1, 0: "module_depth": -1, 0: "top_modules": 1, 0: "detailed": true, 0: "output_file": null 0: } 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] fp16_auto_cast ............... None 0: [2023-02-03 16:35:03,664] [INFO] [config.py:1011:print] fp16_enabled ................. False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] fp16_master_weights_and_gradients False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] global_rank .................. 0 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] gradient_accumulation_steps .. 1 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] gradient_clipping ............ 1.0 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] gradient_predivide_factor .... 1.0 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] initial_dynamic_scale ........ 1 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] load_universal_checkpoint .... False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] loss_scale ................... 1.0 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] memory_breakdown ............. False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] monitor_config ............... 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] nebula_config ................ { 0: "enabled": false, 0: "persistent_storage_path": null, 0: "persistent_time_interval": 100, 0: "num_of_version_in_retention": 2, 0: "enable_nebula_load": true, 0: "load_path": null 0: } 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] optimizer_legacy_fusion ...... False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] optimizer_name ............... None 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] optimizer_params ............. None 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] pipeline ..................... {'stages': 'auto', 'partition': 'best', 'seed_layers': False, 'activation_checkpoint_interval': 0} 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] pld_enabled .................. False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] pld_params ................... False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] prescale_gradients ........... False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] scheduler_name ............... None 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] scheduler_params ............. None 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] sparse_attention ............. None 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] sparse_gradients_enabled ..... False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] steps_per_print .............. 2000 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] train_batch_size ............. 256 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] train_micro_batch_size_per_gpu 2 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] use_node_local_storage ....... False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] wall_clock_breakdown ......... False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] world_size ................... 128 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] zero_allow_untested_optimizer False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] zero_config .................. stage=0 contiguous_gradients=True reduce_scatter=True reduce_bucket_size=500000000 allgather_partitions=True allgather_bucket_size=500000000 overlap_comm=False load_from_fp32_weights=True elastic_checkpoint=False offload_param=None offload_optimizer=None sub_group_size=1000000000 cpu_offload_param=None cpu_offload_use_pin_memory=None cpu_offload=None prefetch_bucket_size=50000000 param_persistence_threshold=100000 model_persistence_threshold=9223372036854775807 max_live_parameters=1000000000 max_reuse_distance=1000000000 gather_16bit_weights_on_model_save=False stage3_gather_fp16_weights_on_model_save=False ignore_unused_parameters=True legacy_stage1=False round_robin_gradients=False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] zero_enabled ................. False 0: [2023-02-03 16:35:03,665] [INFO] [config.py:1011:print] zero_optimization_stage ...... 0 0: [2023-02-03 16:35:03,665] [INFO] [config.py:996:print_user_config] json = { 0: "train_micro_batch_size_per_gpu": 2, 0: "train_batch_size": 256, 0: "gradient_clipping": 1.0, 0: "zero_optimization": { 0: "stage": 0 0: }, 0: "bf16": { 0: "enabled": true 0: }, 0: "steps_per_print": 2.000000e+03, 0: "wall_clock_breakdown": false 0: } 0: Time to load utils op: 0.0004382133483886719 seconds 0: [2023-02-03 16:35:03,666] [INFO] [engine.py:87:__init__] CONFIG: micro_batches=1 micro_batch_size=2 0: [2023-02-03 16:35:03,748] [INFO] [engine.py:145:__init__] RANK=0 STAGE=0 LAYERS=35 [0, 35) STAGE_PARAMS=1517252608 (1517.253M) TOTAL_PARAMS=1517252608 (1517.253M) UNIQUE_PARAMS=1517252608 (1517.253M) 12: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 9: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 12: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 12: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 12: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 12: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 9: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 12: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 14: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 12: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 14: [2023-02-03 16:35:03,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:03,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:03,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:03,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:03,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:03,779] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,779] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,779] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:03,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 8: [2023-02-03 16:35:03,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:03,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 0: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 11: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 4: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 3: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 3: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 3: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 3: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 3: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 3: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 13: [2023-02-03 16:35:03,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 15: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 3: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 3: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt... 2: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 2: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 2: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 2: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 3: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 2: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 3: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 6: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 3: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 3: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 1: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 3: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 4: [2023-02-03 16:35:03,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 11: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 7: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 13: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 0: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 15: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/mp_rank_00_model_states.pt. 5: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:03,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:04,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:04,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:04,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:04,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:04,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:04,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:04,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:04,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:04,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:04,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:04,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:04,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:04,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:04,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:04,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 12: [2023-02-03 16:35:04,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 12: [2023-02-03 16:35:04,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 12: [2023-02-03 16:35:04,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:04,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:04,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:04,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:04,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 2: [2023-02-03 16:35:04,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 2: [2023-02-03 16:35:04,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 2: [2023-02-03 16:35:04,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 2: [2023-02-03 16:35:04,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:04,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:04,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 12: [2023-02-03 16:35:04,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 12: [2023-02-03 16:35:04,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 12: [2023-02-03 16:35:04,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 12: [2023-02-03 16:35:04,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 12: [2023-02-03 16:35:04,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:04,169] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:04,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:04,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:04,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:04,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 11: [2023-02-03 16:35:04,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 2: [2023-02-03 16:35:04,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 2: [2023-02-03 16:35:04,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 2: [2023-02-03 16:35:04,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 2: [2023-02-03 16:35:04,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:04,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:04,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:04,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:04,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 7: [2023-02-03 16:35:04,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:04,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:04,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:04,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:04,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:04,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:04,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:04,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 0: [2023-02-03 16:35:04,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:04,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:04,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:04,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:04,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:04,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:04,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 14: [2023-02-03 16:35:04,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:04,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:04,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:04,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:04,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:04,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:04,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:04,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:04,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:04,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:04,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:04,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:04,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:04,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:04,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:04,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:04,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:04,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:04,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:04,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:04,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 5: [2023-02-03 16:35:04,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 1: [2023-02-03 16:35:04,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:04,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:04,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:04,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:04,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:04,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:04,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:04,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 9: [2023-02-03 16:35:04,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:04,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:04,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:04,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:04,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:04,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:04,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 15: [2023-02-03 16:35:04,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:04,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:04,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:04,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:04,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:04,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:04,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 13: [2023-02-03 16:35:04,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 10: [2023-02-03 16:35:04,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:04,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:04,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 3: [2023-02-03 16:35:04,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 8: [2023-02-03 16:35:04,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:04,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:04,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 6: [2023-02-03 16:35:04,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt... 4: [2023-02-03 16:35:04,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 14: [2023-02-03 16:35:04,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 4: [2023-02-03 16:35:04,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 9: [2023-02-03 16:35:04,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 9: [2023-02-03 16:35:04,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 9: [2023-02-03 16:35:04,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 11: [2023-02-03 16:35:04,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 11: [2023-02-03 16:35:04,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 7: [2023-02-03 16:35:04,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 8: [2023-02-03 16:35:04,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 1: [2023-02-03 16:35:04,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 12: [2023-02-03 16:35:04,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 15: [2023-02-03 16:35:04,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 11: [2023-02-03 16:35:04,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 2: [2023-02-03 16:35:04,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 13: [2023-02-03 16:35:04,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 3: [2023-02-03 16:35:04,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 0: [2023-02-03 16:35:04,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 5: [2023-02-03 16:35:04,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 14: [2023-02-03 16:35:04,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 14: [2023-02-03 16:35:04,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 14: [2023-02-03 16:35:04,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 14: [2023-02-03 16:35:04,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 14: [2023-02-03 16:35:04,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 10: [2023-02-03 16:35:04,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 14: [2023-02-03 16:35:04,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 14: [2023-02-03 16:35:04,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_01-model_00-model_states.pt. 6: [2023-02-03 16:35:04,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 2: [2023-02-03 16:35:04,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 2: [2023-02-03 16:35:04,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 5: [2023-02-03 16:35:04,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 1: [2023-02-03 16:35:04,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 14: [2023-02-03 16:35:04,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 14: [2023-02-03 16:35:04,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 14: [2023-02-03 16:35:04,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 6: [2023-02-03 16:35:04,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 9: [2023-02-03 16:35:04,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 9: [2023-02-03 16:35:04,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 9: [2023-02-03 16:35:04,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,561] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 8: [2023-02-03 16:35:04,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 8: [2023-02-03 16:35:04,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 13: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 14: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 11: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 14: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 11: [2023-02-03 16:35:04,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 11: [2023-02-03 16:35:04,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 11: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 11: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 11: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 0: [2023-02-03 16:35:04,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 15: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 4: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 4: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 11: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 11: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 11: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 15: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 15: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 15: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 15: [2023-02-03 16:35:04,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 7: [2023-02-03 16:35:04,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 15: [2023-02-03 16:35:04,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 1: [2023-02-03 16:35:04,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 1: [2023-02-03 16:35:04,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 1: [2023-02-03 16:35:04,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 3: [2023-02-03 16:35:04,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 12: [2023-02-03 16:35:04,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt... 10: [2023-02-03 16:35:04,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 2: [2023-02-03 16:35:04,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 10: [2023-02-03 16:35:04,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 13: [2023-02-03 16:35:04,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 13: [2023-02-03 16:35:04,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 13: [2023-02-03 16:35:04,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 2: [2023-02-03 16:35:04,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 2: [2023-02-03 16:35:04,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 9: [2023-02-03 16:35:04,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 15: [2023-02-03 16:35:04,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 15: [2023-02-03 16:35:04,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 15: [2023-02-03 16:35:04,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 15: [2023-02-03 16:35:04,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 15: [2023-02-03 16:35:04,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 15: [2023-02-03 16:35:04,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 15: [2023-02-03 16:35:04,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 2: [2023-02-03 16:35:04,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 14: [2023-02-03 16:35:04,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 5: [2023-02-03 16:35:04,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 2: [2023-02-03 16:35:04,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 2: [2023-02-03 16:35:04,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 2: [2023-02-03 16:35:04,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 7: [2023-02-03 16:35:04,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 11: [2023-02-03 16:35:04,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 6: [2023-02-03 16:35:04,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 0: [2023-02-03 16:35:04,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 12: [2023-02-03 16:35:04,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_03-model_00-model_states.pt. 3: [2023-02-03 16:35:04,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 1: [2023-02-03 16:35:04,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 1: [2023-02-03 16:35:04,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 1: [2023-02-03 16:35:04,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 1: [2023-02-03 16:35:04,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 1: [2023-02-03 16:35:04,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 1: [2023-02-03 16:35:04,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 1: [2023-02-03 16:35:04,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 1: [2023-02-03 16:35:04,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 8: [2023-02-03 16:35:04,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:04,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:04,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 1: [2023-02-03 16:35:04,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 8: [2023-02-03 16:35:04,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:04,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:04,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 1: [2023-02-03 16:35:04,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:04,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:04,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 1: [2023-02-03 16:35:04,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:04,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:04,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:04,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:04,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:04,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:04,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:04,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:04,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:04,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:04,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:04,886] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 12: [2023-02-03 16:35:04,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 2: [2023-02-03 16:35:04,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 2: [2023-02-03 16:35:04,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 4: [2023-02-03 16:35:04,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 2: [2023-02-03 16:35:04,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 2: [2023-02-03 16:35:04,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 2: [2023-02-03 16:35:04,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 9: [2023-02-03 16:35:04,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 9: [2023-02-03 16:35:04,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 4: [2023-02-03 16:35:04,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:04,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:04,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:04,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:04,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:04,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:04,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:04,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:04,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:04,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:04,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:04,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:04,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:04,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:04,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:04,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:04,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:04,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 2: [2023-02-03 16:35:04,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 10: [2023-02-03 16:35:04,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:04,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:04,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 13: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 13: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 13: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 13: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 13: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 13: [2023-02-03 16:35:04,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 10: [2023-02-03 16:35:04,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:04,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:04,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:04,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 12: [2023-02-03 16:35:04,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:04,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:04,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:04,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:04,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:04,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 12: [2023-02-03 16:35:04,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:04,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 14: [2023-02-03 16:35:04,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 5: [2023-02-03 16:35:04,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 12: [2023-02-03 16:35:04,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 12: [2023-02-03 16:35:04,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 12: [2023-02-03 16:35:04,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 12: [2023-02-03 16:35:04,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:04,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:04,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:04,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:04,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:04,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 15: [2023-02-03 16:35:04,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 15: [2023-02-03 16:35:04,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 15: [2023-02-03 16:35:04,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 15: [2023-02-03 16:35:04,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 15: [2023-02-03 16:35:04,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 15: [2023-02-03 16:35:04,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 15: [2023-02-03 16:35:04,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 15: [2023-02-03 16:35:04,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 15: [2023-02-03 16:35:04,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 13: [2023-02-03 16:35:04,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,963] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:04,963] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:04,963] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:04,963] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:04,963] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:04,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:04,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:04,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:04,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:04,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:04,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 6: [2023-02-03 16:35:04,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,972] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,972] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,972] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,972] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,972] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,972] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,972] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,972] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 13: [2023-02-03 16:35:04,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:04,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 11: [2023-02-03 16:35:04,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 0: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 3: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:04,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt... 7: [2023-02-03 16:35:04,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:04,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:04,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:04,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:04,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 13: [2023-02-03 16:35:04,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:04,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:04,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:04,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:04,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 13: [2023-02-03 16:35:04,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,988] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:04,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:04,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:04,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:04,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:04,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:04,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:04,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 14: [2023-02-03 16:35:04,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:04,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:04,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:04,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:04,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:04,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 11: [2023-02-03 16:35:04,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:04,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 15: [2023-02-03 16:35:04,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 7: [2023-02-03 16:35:04,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:04,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:04,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:04,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:04,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 15: [2023-02-03 16:35:04,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 15: [2023-02-03 16:35:04,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:04,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 3: [2023-02-03 16:35:04,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:04,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:04,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:04,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 13: [2023-02-03 16:35:05,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:05,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:05,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:05,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 13: [2023-02-03 16:35:05,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 13: [2023-02-03 16:35:05,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:05,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 13: [2023-02-03 16:35:05,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 13: [2023-02-03 16:35:05,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 13: [2023-02-03 16:35:05,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 5: [2023-02-03 16:35:05,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:05,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:05,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:05,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 15: [2023-02-03 16:35:05,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 6: [2023-02-03 16:35:05,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:05,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:05,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:05,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:05,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:05,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:05,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_04-model_00-model_states.pt. 0: [2023-02-03 16:35:05,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:05,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:05,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:05,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:05,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:05,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:05,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:05,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:05,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:05,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:05,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:05,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:05,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:05,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:05,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:05,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:05,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 8: [2023-02-03 16:35:05,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 8: [2023-02-03 16:35:05,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 8: [2023-02-03 16:35:05,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:05,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:05,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:05,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:05,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:05,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:05,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:05,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 14: [2023-02-03 16:35:05,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:05,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 8: [2023-02-03 16:35:05,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:05,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:05,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:05,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:05,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:05,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:05,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:05,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:05,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 14: [2023-02-03 16:35:05,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:05,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:05,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:05,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:05,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:05,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 1: [2023-02-03 16:35:05,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 8: [2023-02-03 16:35:05,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 1: [2023-02-03 16:35:05,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:05,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:05,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:05,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:05,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:05,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:05,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:05,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 4: [2023-02-03 16:35:05,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:05,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:05,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:05,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:05,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:05,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:05,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 11: [2023-02-03 16:35:05,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 10: [2023-02-03 16:35:05,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 3: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 10: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 13: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 13: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 13: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 10: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 10: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 15: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:05,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:05,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 6: [2023-02-03 16:35:05,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:05,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:05,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:05,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:05,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 9: [2023-02-03 16:35:05,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:05,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 4: [2023-02-03 16:35:05,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 14: [2023-02-03 16:35:05,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 3: [2023-02-03 16:35:05,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 9: [2023-02-03 16:35:05,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 14: [2023-02-03 16:35:05,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 14: [2023-02-03 16:35:05,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 14: [2023-02-03 16:35:05,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 14: [2023-02-03 16:35:05,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 15: [2023-02-03 16:35:05,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 2: [2023-02-03 16:35:05,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:05,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:05,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:05,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 2: [2023-02-03 16:35:05,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:05,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:05,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:05,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 10: [2023-02-03 16:35:05,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 10: [2023-02-03 16:35:05,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 10: [2023-02-03 16:35:05,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 10: [2023-02-03 16:35:05,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 10: [2023-02-03 16:35:05,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 10: [2023-02-03 16:35:05,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 10: [2023-02-03 16:35:05,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 6: [2023-02-03 16:35:05,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 11: [2023-02-03 16:35:05,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 13: [2023-02-03 16:35:05,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 10: [2023-02-03 16:35:05,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 0: [2023-02-03 16:35:05,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 5: [2023-02-03 16:35:05,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 0: [2023-02-03 16:35:05,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,384] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 7: [2023-02-03 16:35:05,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 12: [2023-02-03 16:35:05,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 12: [2023-02-03 16:35:05,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 12: [2023-02-03 16:35:05,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 12: [2023-02-03 16:35:05,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 12: [2023-02-03 16:35:05,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 12: [2023-02-03 16:35:05,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 12: [2023-02-03 16:35:05,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt... 7: [2023-02-03 16:35:05,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,425] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 5: [2023-02-03 16:35:05,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,443] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,453] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_05-model_00-model_states.pt. 12: [2023-02-03 16:35:05,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 1: [2023-02-03 16:35:05,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 8: [2023-02-03 16:35:05,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 1: [2023-02-03 16:35:05,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 8: [2023-02-03 16:35:05,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 8: [2023-02-03 16:35:05,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 8: [2023-02-03 16:35:05,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 8: [2023-02-03 16:35:05,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 8: [2023-02-03 16:35:05,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 7: [2023-02-03 16:35:05,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 4: [2023-02-03 16:35:05,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 8: [2023-02-03 16:35:05,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 9: [2023-02-03 16:35:05,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 9: [2023-02-03 16:35:05,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 9: [2023-02-03 16:35:05,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 9: [2023-02-03 16:35:05,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 9: [2023-02-03 16:35:05,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 3: [2023-02-03 16:35:05,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:05,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 9: [2023-02-03 16:35:05,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 8: [2023-02-03 16:35:05,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:05,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:05,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 3: [2023-02-03 16:35:05,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:05,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:05,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:05,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:05,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 9: [2023-02-03 16:35:05,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:05,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:05,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 9: [2023-02-03 16:35:05,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 4: [2023-02-03 16:35:05,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:05,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:05,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:05,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 7: [2023-02-03 16:35:05,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:05,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:05,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 13: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 14: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 11: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 9: [2023-02-03 16:35:05,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 9: [2023-02-03 16:35:05,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 13: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 14: [2023-02-03 16:35:05,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 14: [2023-02-03 16:35:05,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 14: [2023-02-03 16:35:05,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 14: [2023-02-03 16:35:05,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 14: [2023-02-03 16:35:05,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 14: [2023-02-03 16:35:05,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 2: [2023-02-03 16:35:05,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:05,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 5: [2023-02-03 16:35:05,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 9: [2023-02-03 16:35:05,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:05,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 10: [2023-02-03 16:35:05,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 15: [2023-02-03 16:35:05,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 0: [2023-02-03 16:35:05,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:05,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 0: [2023-02-03 16:35:05,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:05,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 12: [2023-02-03 16:35:05,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt... 6: [2023-02-03 16:35:05,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:05,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 11: [2023-02-03 16:35:05,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:05,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,741] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 15: [2023-02-03 16:35:05,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 14: [2023-02-03 16:35:05,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 2: [2023-02-03 16:35:05,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:05,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:05,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 2: [2023-02-03 16:35:05,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 2: [2023-02-03 16:35:05,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:05,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 2: [2023-02-03 16:35:05,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:05,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 2: [2023-02-03 16:35:05,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:05,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 6: [2023-02-03 16:35:05,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:05,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:05,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:05,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:05,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 10: [2023-02-03 16:35:05,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 5: [2023-02-03 16:35:05,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:05,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_06-model_00-model_states.pt. 12: [2023-02-03 16:35:05,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:05,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 2: [2023-02-03 16:35:05,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:05,763] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:05,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:05,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:05,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:05,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:05,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:05,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:05,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:05,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:05,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:05,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:05,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:05,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:05,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:05,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 1: [2023-02-03 16:35:05,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:05,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 1: [2023-02-03 16:35:05,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:05,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 1: [2023-02-03 16:35:05,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:05,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 1: [2023-02-03 16:35:05,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:05,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 1: [2023-02-03 16:35:05,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:05,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:05,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 4: [2023-02-03 16:35:05,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:05,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:05,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:05,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:05,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:05,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:05,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:05,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:05,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 4: [2023-02-03 16:35:05,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:05,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:05,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:05,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:05,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:05,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:05,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:05,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:05,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:05,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 6: [2023-02-03 16:35:05,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:05,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:05,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:05,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:05,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:05,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:05,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:05,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:05,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:05,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:05,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:05,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:05,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:05,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:05,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:05,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:05,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 6: [2023-02-03 16:35:05,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:05,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:05,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:05,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:05,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:05,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:05,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:05,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:05,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:05,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:05,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:05,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:05,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:05,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:05,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:05,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:05,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:05,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:05,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:05,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:05,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:05,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:05,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:05,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 11: [2023-02-03 16:35:05,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:05,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:05,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 15: [2023-02-03 16:35:05,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:05,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 8: [2023-02-03 16:35:05,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:05,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:05,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:05,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:05,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:05,987] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:05,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 0: [2023-02-03 16:35:05,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:05,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:05,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 8: [2023-02-03 16:35:05,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:05,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:05,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:05,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:05,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:05,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:05,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:05,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:05,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:05,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:05,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:05,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:05,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:05,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:05,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:06,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 2: [2023-02-03 16:35:06,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 2: [2023-02-03 16:35:06,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 2: [2023-02-03 16:35:06,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 2: [2023-02-03 16:35:06,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 2: [2023-02-03 16:35:06,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:06,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 2: [2023-02-03 16:35:06,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:06,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:06,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:06,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:06,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:06,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:06,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 14: [2023-02-03 16:35:06,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:06,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:06,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 11: [2023-02-03 16:35:06,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:06,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 9: [2023-02-03 16:35:06,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:06,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:06,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:06,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:06,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:06,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:06,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:06,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 0: [2023-02-03 16:35:06,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:06,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:06,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:06,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:06,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:06,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:06,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:06,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:06,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:06,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:06,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:06,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:06,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:06,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:06,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:06,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:06,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:06,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:06,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:06,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:06,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:06,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:06,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:06,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:06,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:06,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 14: [2023-02-03 16:35:06,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:06,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:06,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:06,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:06,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:06,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 7: [2023-02-03 16:35:06,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 3: [2023-02-03 16:35:06,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 3: [2023-02-03 16:35:06,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:06,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,037] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:06,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 15: [2023-02-03 16:35:06,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 9: [2023-02-03 16:35:06,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 3: [2023-02-03 16:35:06,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:06,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:06,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:06,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:06,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:06,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 5: [2023-02-03 16:35:06,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 7: [2023-02-03 16:35:06,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 3: [2023-02-03 16:35:06,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 3: [2023-02-03 16:35:06,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 3: [2023-02-03 16:35:06,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 3: [2023-02-03 16:35:06,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,047] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:06,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:06,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:06,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:06,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:06,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 13: [2023-02-03 16:35:06,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 12: [2023-02-03 16:35:06,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt... 10: [2023-02-03 16:35:06,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:06,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:06,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:06,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:06,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 2: [2023-02-03 16:35:06,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:06,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:06,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 13: [2023-02-03 16:35:06,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 10: [2023-02-03 16:35:06,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 5: [2023-02-03 16:35:06,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_07-model_00-model_states.pt. 12: [2023-02-03 16:35:06,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 1: [2023-02-03 16:35:06,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 1: [2023-02-03 16:35:06,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 1: [2023-02-03 16:35:06,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 1: [2023-02-03 16:35:06,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 1: [2023-02-03 16:35:06,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 1: [2023-02-03 16:35:06,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 1: [2023-02-03 16:35:06,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 1: [2023-02-03 16:35:06,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:06,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,132] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,132] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,132] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,132] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,132] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:06,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:06,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:06,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:06,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:06,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:06,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:06,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:06,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 1: [2023-02-03 16:35:06,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,187] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,187] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,187] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,187] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,187] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,187] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,187] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,187] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 6: [2023-02-03 16:35:06,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 6: [2023-02-03 16:35:06,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:06,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,245] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:06,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:06,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:06,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:06,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:06,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:06,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:06,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 0: [2023-02-03 16:35:06,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 0: [2023-02-03 16:35:06,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:06,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:06,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:06,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:06,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:06,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 8: [2023-02-03 16:35:06,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 8: [2023-02-03 16:35:06,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 3: [2023-02-03 16:35:06,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 3: [2023-02-03 16:35:06,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 3: [2023-02-03 16:35:06,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 3: [2023-02-03 16:35:06,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:06,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 4: [2023-02-03 16:35:06,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 4: [2023-02-03 16:35:06,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 7: [2023-02-03 16:35:06,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 14: [2023-02-03 16:35:06,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 14: [2023-02-03 16:35:06,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 9: [2023-02-03 16:35:06,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 15: [2023-02-03 16:35:06,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:06,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 12: [2023-02-03 16:35:06,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 9: [2023-02-03 16:35:06,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 5: [2023-02-03 16:35:06,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 3: [2023-02-03 16:35:06,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 10: [2023-02-03 16:35:06,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 11: [2023-02-03 16:35:06,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 11: [2023-02-03 16:35:06,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 3: [2023-02-03 16:35:06,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,361] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,361] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,361] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 15: [2023-02-03 16:35:06,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 15: [2023-02-03 16:35:06,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 15: [2023-02-03 16:35:06,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 15: [2023-02-03 16:35:06,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 2: [2023-02-03 16:35:06,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 3: [2023-02-03 16:35:06,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 2: [2023-02-03 16:35:06,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 2: [2023-02-03 16:35:06,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 2: [2023-02-03 16:35:06,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 2: [2023-02-03 16:35:06,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 2: [2023-02-03 16:35:06,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 3: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 13: [2023-02-03 16:35:06,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 15: [2023-02-03 16:35:06,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 15: [2023-02-03 16:35:06,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 15: [2023-02-03 16:35:06,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:06,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:06,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:06,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:06,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:06,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 2: [2023-02-03 16:35:06,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt... 5: [2023-02-03 16:35:06,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 7: [2023-02-03 16:35:06,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,390] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,395] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 12: [2023-02-03 16:35:06,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 2: [2023-02-03 16:35:06,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 2: [2023-02-03 16:35:06,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 2: [2023-02-03 16:35:06,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 2: [2023-02-03 16:35:06,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 2: [2023-02-03 16:35:06,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 10: [2023-02-03 16:35:06,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 13: [2023-02-03 16:35:06,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_08-model_00-model_states.pt. 2: [2023-02-03 16:35:06,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 6: [2023-02-03 16:35:06,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 1: [2023-02-03 16:35:06,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 1: [2023-02-03 16:35:06,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 6: [2023-02-03 16:35:06,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,542] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,542] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,542] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 14: [2023-02-03 16:35:06,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,609] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,609] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,609] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 9: [2023-02-03 16:35:06,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 9: [2023-02-03 16:35:06,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 4: [2023-02-03 16:35:06,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 15: [2023-02-03 16:35:06,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 15: [2023-02-03 16:35:06,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 15: [2023-02-03 16:35:06,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 15: [2023-02-03 16:35:06,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 15: [2023-02-03 16:35:06,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 0: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 4: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 11: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 0: [2023-02-03 16:35:06,617] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 7: [2023-02-03 16:35:06,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 5: [2023-02-03 16:35:06,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 12: [2023-02-03 16:35:06,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 10: [2023-02-03 16:35:06,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 2: [2023-02-03 16:35:06,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 2: [2023-02-03 16:35:06,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 13: [2023-02-03 16:35:06,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 13: [2023-02-03 16:35:06,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 13: [2023-02-03 16:35:06,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 13: [2023-02-03 16:35:06,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 13: [2023-02-03 16:35:06,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 13: [2023-02-03 16:35:06,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 9: [2023-02-03 16:35:06,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 13: [2023-02-03 16:35:06,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 3: [2023-02-03 16:35:06,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 8: [2023-02-03 16:35:06,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 13: [2023-02-03 16:35:06,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt... 8: [2023-02-03 16:35:06,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 9: [2023-02-03 16:35:06,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 9: [2023-02-03 16:35:06,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 9: [2023-02-03 16:35:06,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 14: [2023-02-03 16:35:06,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 14: [2023-02-03 16:35:06,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,645] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 5: [2023-02-03 16:35:06,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 14: [2023-02-03 16:35:06,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 14: [2023-02-03 16:35:06,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 14: [2023-02-03 16:35:06,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 13: [2023-02-03 16:35:06,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 3: [2023-02-03 16:35:06,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 7: [2023-02-03 16:35:06,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 12: [2023-02-03 16:35:06,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 15: [2023-02-03 16:35:06,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 13: [2023-02-03 16:35:06,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 13: [2023-02-03 16:35:06,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 13: [2023-02-03 16:35:06,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 11: [2023-02-03 16:35:06,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_09-model_00-model_states.pt. 10: [2023-02-03 16:35:06,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,783] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 1: [2023-02-03 16:35:06,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 1: [2023-02-03 16:35:06,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:06,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:06,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:06,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:06,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:06,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:06,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:06,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 6: [2023-02-03 16:35:06,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 9: [2023-02-03 16:35:06,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 9: [2023-02-03 16:35:06,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 9: [2023-02-03 16:35:06,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 9: [2023-02-03 16:35:06,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 9: [2023-02-03 16:35:06,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 9: [2023-02-03 16:35:06,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:06,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 2: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:06,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 6: [2023-02-03 16:35:06,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 6: [2023-02-03 16:35:06,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 6: [2023-02-03 16:35:06,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:06,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 6: [2023-02-03 16:35:06,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:06,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:06,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:06,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 9: [2023-02-03 16:35:06,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:06,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:06,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:06,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 9: [2023-02-03 16:35:06,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 4: [2023-02-03 16:35:06,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:06,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 4: [2023-02-03 16:35:06,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:06,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:06,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:06,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:06,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:06,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:06,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:06,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:06,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:06,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:06,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 5: [2023-02-03 16:35:06,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:06,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 9: [2023-02-03 16:35:06,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:06,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 9: [2023-02-03 16:35:06,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 15: [2023-02-03 16:35:06,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 5: [2023-02-03 16:35:06,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:06,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 9: [2023-02-03 16:35:06,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 9: [2023-02-03 16:35:06,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:06,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 14: [2023-02-03 16:35:06,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 14: [2023-02-03 16:35:06,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 14: [2023-02-03 16:35:06,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 14: [2023-02-03 16:35:06,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 14: [2023-02-03 16:35:06,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 2: [2023-02-03 16:35:06,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:06,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:06,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:06,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:06,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 13: [2023-02-03 16:35:06,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 0: [2023-02-03 16:35:06,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 10: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 3: [2023-02-03 16:35:06,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 13: [2023-02-03 16:35:06,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 13: [2023-02-03 16:35:06,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 13: [2023-02-03 16:35:06,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 13: [2023-02-03 16:35:06,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 13: [2023-02-03 16:35:06,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 13: [2023-02-03 16:35:06,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 7: [2023-02-03 16:35:06,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 12: [2023-02-03 16:35:06,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:06,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:06,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:06,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:06,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 8: [2023-02-03 16:35:06,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:06,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:06,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:06,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:06,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:06,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 3: [2023-02-03 16:35:06,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:06,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:06,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:06,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:06,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:06,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:06,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 8: [2023-02-03 16:35:06,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:06,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 15: [2023-02-03 16:35:06,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:06,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:06,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:06,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:06,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 0: [2023-02-03 16:35:06,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:06,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:06,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:06,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:06,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:06,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 13: [2023-02-03 16:35:06,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:06,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:06,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:06,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:06,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:06,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:06,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:06,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:06,963] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,963] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,963] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,963] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 10: [2023-02-03 16:35:06,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:06,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:06,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:06,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:06,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:06,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:06,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 7: [2023-02-03 16:35:06,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 13: [2023-02-03 16:35:06,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:06,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:06,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:06,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 13: [2023-02-03 16:35:06,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 13: [2023-02-03 16:35:06,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:06,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 13: [2023-02-03 16:35:06,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 14: [2023-02-03 16:35:06,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:06,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:06,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 12: [2023-02-03 16:35:06,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:06,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:06,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:06,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:06,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:06,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:06,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:06,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:06,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:06,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:06,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:06,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:06,987] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:06,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:06,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:06,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:06,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:06,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:06,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:06,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:06,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:06,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:06,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:06,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:06,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:06,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:06,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:06,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 13: [2023-02-03 16:35:06,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:06,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:06,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:07,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:07,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:07,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:07,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt... 11: [2023-02-03 16:35:07,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:07,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:07,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:07,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:07,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:07,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:07,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:07,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_10-model_00-model_states.pt. 11: [2023-02-03 16:35:07,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:07,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:07,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:07,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:07,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:07,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:07,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:07,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:07,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:07,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 1: [2023-02-03 16:35:07,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 6: [2023-02-03 16:35:07,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 6: [2023-02-03 16:35:07,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 6: [2023-02-03 16:35:07,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 6: [2023-02-03 16:35:07,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 6: [2023-02-03 16:35:07,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 6: [2023-02-03 16:35:07,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 6: [2023-02-03 16:35:07,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 1: [2023-02-03 16:35:07,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 6: [2023-02-03 16:35:07,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 6: [2023-02-03 16:35:07,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 6: [2023-02-03 16:35:07,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 6: [2023-02-03 16:35:07,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,158] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:07,158] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 6: [2023-02-03 16:35:07,159] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 6: [2023-02-03 16:35:07,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:07,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:07,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:07,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:07,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:07,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 6: [2023-02-03 16:35:07,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 6: [2023-02-03 16:35:07,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 6: [2023-02-03 16:35:07,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:07,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:07,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:07,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:07,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:07,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:07,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:07,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 9: [2023-02-03 16:35:07,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:07,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:07,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:07,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:07,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:07,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:07,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:07,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:07,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:07,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:07,187] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 9: [2023-02-03 16:35:07,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 9: [2023-02-03 16:35:07,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 9: [2023-02-03 16:35:07,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 9: [2023-02-03 16:35:07,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 9: [2023-02-03 16:35:07,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:07,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:07,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:07,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:07,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:07,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:07,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:07,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:07,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:07,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:07,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:07,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:07,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:07,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:07,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:07,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:07,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:07,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:07,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:07,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:07,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:07,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:07,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:07,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 5: [2023-02-03 16:35:07,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:07,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:07,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:07,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:07,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:07,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:07,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 0: [2023-02-03 16:35:07,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:07,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:07,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:07,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 15: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:07,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:07,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:07,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:07,211] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,211] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 4: [2023-02-03 16:35:07,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:07,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:07,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:07,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 8: [2023-02-03 16:35:07,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:07,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:07,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:07,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:07,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 13: [2023-02-03 16:35:07,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 9: [2023-02-03 16:35:07,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,228] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 8: [2023-02-03 16:35:07,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 4: [2023-02-03 16:35:07,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 9: [2023-02-03 16:35:07,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 12: [2023-02-03 16:35:07,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 9: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 12: [2023-02-03 16:35:07,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,241] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 2: [2023-02-03 16:35:07,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:07,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:07,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:07,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 3: [2023-02-03 16:35:07,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 14: [2023-02-03 16:35:07,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:07,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:07,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:07,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:07,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:07,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 10: [2023-02-03 16:35:07,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 11: [2023-02-03 16:35:07,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt... 7: [2023-02-03 16:35:07,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 3: [2023-02-03 16:35:07,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 9: [2023-02-03 16:35:07,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 9: [2023-02-03 16:35:07,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 9: [2023-02-03 16:35:07,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 12: [2023-02-03 16:35:07,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 2: [2023-02-03 16:35:07,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 9: [2023-02-03 16:35:07,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 12: [2023-02-03 16:35:07,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 9: [2023-02-03 16:35:07,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 12: [2023-02-03 16:35:07,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 12: [2023-02-03 16:35:07,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 5: [2023-02-03 16:35:07,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 0: [2023-02-03 16:35:07,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 15: [2023-02-03 16:35:07,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 7: [2023-02-03 16:35:07,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 13: [2023-02-03 16:35:07,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 11: [2023-02-03 16:35:07,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 14: [2023-02-03 16:35:07,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_11-model_00-model_states.pt. 10: [2023-02-03 16:35:07,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 9: [2023-02-03 16:35:07,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 6: [2023-02-03 16:35:07,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 6: [2023-02-03 16:35:07,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 6: [2023-02-03 16:35:07,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 6: [2023-02-03 16:35:07,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 6: [2023-02-03 16:35:07,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 6: [2023-02-03 16:35:07,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 9: [2023-02-03 16:35:07,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 12: [2023-02-03 16:35:07,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 12: [2023-02-03 16:35:07,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 12: [2023-02-03 16:35:07,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 12: [2023-02-03 16:35:07,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 12: [2023-02-03 16:35:07,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 9: [2023-02-03 16:35:07,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 9: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 9: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 12: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 12: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 12: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 13: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 4: [2023-02-03 16:35:07,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 13: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 13: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 13: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 13: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 15: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 5: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 4: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 7: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 8: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,542] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 13: [2023-02-03 16:35:07,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 1: [2023-02-03 16:35:07,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 0: [2023-02-03 16:35:07,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 2: [2023-02-03 16:35:07,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 14: [2023-02-03 16:35:07,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 10: [2023-02-03 16:35:07,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 6: [2023-02-03 16:35:07,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 11: [2023-02-03 16:35:07,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 3: [2023-02-03 16:35:07,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt... 9: [2023-02-03 16:35:07,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,560] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,561] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 13: [2023-02-03 16:35:07,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 13: [2023-02-03 16:35:07,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 6: [2023-02-03 16:35:07,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 12: [2023-02-03 16:35:07,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 7: [2023-02-03 16:35:07,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 12: [2023-02-03 16:35:07,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 15: [2023-02-03 16:35:07,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 7: [2023-02-03 16:35:07,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 13: [2023-02-03 16:35:07,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 13: [2023-02-03 16:35:07,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 13: [2023-02-03 16:35:07,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 13: [2023-02-03 16:35:07,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 13: [2023-02-03 16:35:07,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 8: [2023-02-03 16:35:07,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 6: [2023-02-03 16:35:07,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 6: [2023-02-03 16:35:07,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 6: [2023-02-03 16:35:07,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 6: [2023-02-03 16:35:07,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 5: [2023-02-03 16:35:07,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 12: [2023-02-03 16:35:07,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 3: [2023-02-03 16:35:07,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,596] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,596] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 1: [2023-02-03 16:35:07,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 12: [2023-02-03 16:35:07,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,597] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 7: [2023-02-03 16:35:07,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,597] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 12: [2023-02-03 16:35:07,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 7: [2023-02-03 16:35:07,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 12: [2023-02-03 16:35:07,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 14: [2023-02-03 16:35:07,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 7: [2023-02-03 16:35:07,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 7: [2023-02-03 16:35:07,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 7: [2023-02-03 16:35:07,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 0: [2023-02-03 16:35:07,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 10: [2023-02-03 16:35:07,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 11: [2023-02-03 16:35:07,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 9: [2023-02-03 16:35:07,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_12-model_00-model_states.pt. 2: [2023-02-03 16:35:07,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 6: [2023-02-03 16:35:07,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 6: [2023-02-03 16:35:07,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 6: [2023-02-03 16:35:07,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 6: [2023-02-03 16:35:07,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 6: [2023-02-03 16:35:07,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 14: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 2: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 2: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 2: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 2: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 2: [2023-02-03 16:35:07,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 2: [2023-02-03 16:35:07,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 3: [2023-02-03 16:35:07,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 3: [2023-02-03 16:35:07,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 4: [2023-02-03 16:35:07,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:07,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 3: [2023-02-03 16:35:07,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 3: [2023-02-03 16:35:07,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:07,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:07,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:07,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 4: [2023-02-03 16:35:07,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 4: [2023-02-03 16:35:07,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 4: [2023-02-03 16:35:07,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 4: [2023-02-03 16:35:07,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:07,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 11: [2023-02-03 16:35:07,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 15: [2023-02-03 16:35:07,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:07,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 3: [2023-02-03 16:35:07,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 3: [2023-02-03 16:35:07,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 3: [2023-02-03 16:35:07,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:07,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 2: [2023-02-03 16:35:07,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:07,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 2: [2023-02-03 16:35:07,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:07,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:07,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:07,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:07,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 3: [2023-02-03 16:35:07,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:07,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 2: [2023-02-03 16:35:07,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 2: [2023-02-03 16:35:07,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:07,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 6: [2023-02-03 16:35:07,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:07,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:07,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:07,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 3: [2023-02-03 16:35:07,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:07,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 3: [2023-02-03 16:35:07,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 3: [2023-02-03 16:35:07,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 3: [2023-02-03 16:35:07,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 15: [2023-02-03 16:35:07,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:07,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:07,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:07,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:07,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:07,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:07,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:07,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 1: [2023-02-03 16:35:07,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 1: [2023-02-03 16:35:07,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:07,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:07,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 1: [2023-02-03 16:35:07,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 6: [2023-02-03 16:35:07,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:07,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 1: [2023-02-03 16:35:07,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:07,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 6: [2023-02-03 16:35:07,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 6: [2023-02-03 16:35:07,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:07,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 6: [2023-02-03 16:35:07,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:07,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 1: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:07,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 2: [2023-02-03 16:35:07,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:07,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 1: [2023-02-03 16:35:07,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:07,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 12: [2023-02-03 16:35:07,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 12: [2023-02-03 16:35:07,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 1: [2023-02-03 16:35:07,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:07,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 12: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 9: [2023-02-03 16:35:07,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 12: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 12: [2023-02-03 16:35:07,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 5: [2023-02-03 16:35:07,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 3: [2023-02-03 16:35:07,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:07,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:07,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:07,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 11: [2023-02-03 16:35:07,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 8: [2023-02-03 16:35:07,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 11: [2023-02-03 16:35:07,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:07,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 8: [2023-02-03 16:35:07,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:07,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 9: [2023-02-03 16:35:07,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:07,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 0: [2023-02-03 16:35:07,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 2: [2023-02-03 16:35:07,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:07,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:07,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:07,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:07,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:07,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:07,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:07,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:07,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 14: [2023-02-03 16:35:07,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:07,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:07,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:07,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:07,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:07,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:07,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:07,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:07,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:07,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:07,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:07,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:07,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 7: [2023-02-03 16:35:07,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 7: [2023-02-03 16:35:07,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 10: [2023-02-03 16:35:07,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 10: [2023-02-03 16:35:07,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 10: [2023-02-03 16:35:07,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 10: [2023-02-03 16:35:07,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 10: [2023-02-03 16:35:07,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 10: [2023-02-03 16:35:07,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 7: [2023-02-03 16:35:07,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 7: [2023-02-03 16:35:07,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 7: [2023-02-03 16:35:07,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:07,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 11: [2023-02-03 16:35:07,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:07,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 13: [2023-02-03 16:35:07,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 11: [2023-02-03 16:35:07,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:07,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 10: [2023-02-03 16:35:07,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt... 12: [2023-02-03 16:35:07,965] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 11: [2023-02-03 16:35:07,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:07,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:07,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 11: [2023-02-03 16:35:07,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:07,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 10: [2023-02-03 16:35:07,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:07,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,984] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,984] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,984] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:07,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:07,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:07,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 5: [2023-02-03 16:35:07,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:07,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:07,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:07,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:07,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:07,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:07,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:07,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:07,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:07,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:07,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:07,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:07,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:07,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:07,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:07,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:07,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:07,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 12: [2023-02-03 16:35:07,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:07,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:07,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:08,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:08,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:08,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:08,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:08,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:08,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:08,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:08,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:08,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:08,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 10: [2023-02-03 16:35:08,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 10: [2023-02-03 16:35:08,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 10: [2023-02-03 16:35:08,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:08,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 13: [2023-02-03 16:35:08,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 0: [2023-02-03 16:35:08,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:08,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_13-model_00-model_states.pt. 7: [2023-02-03 16:35:08,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:08,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:08,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:08,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:08,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:08,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:08,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:08,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:08,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:08,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:08,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 6: [2023-02-03 16:35:08,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 6: [2023-02-03 16:35:08,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:08,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 6: [2023-02-03 16:35:08,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:08,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:08,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:08,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:08,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:08,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:08,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 15: [2023-02-03 16:35:08,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 6: [2023-02-03 16:35:08,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 6: [2023-02-03 16:35:08,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 6: [2023-02-03 16:35:08,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 6: [2023-02-03 16:35:08,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 5: [2023-02-03 16:35:08,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 5: [2023-02-03 16:35:08,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 15: [2023-02-03 16:35:08,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 12: [2023-02-03 16:35:08,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 6: [2023-02-03 16:35:08,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:08,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:08,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:08,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:08,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:08,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 12: [2023-02-03 16:35:08,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:08,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:08,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:08,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:08,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:08,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:08,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:08,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 3: [2023-02-03 16:35:08,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:08,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 11: [2023-02-03 16:35:08,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 11: [2023-02-03 16:35:08,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:08,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 4: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 11: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 11: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 8: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 11: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 8: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 8: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 8: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:08,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 4: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 4: [2023-02-03 16:35:08,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 8: [2023-02-03 16:35:08,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:08,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:08,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:08,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:08,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:08,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 7: [2023-02-03 16:35:08,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:08,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 2: [2023-02-03 16:35:08,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 10: [2023-02-03 16:35:08,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 4: [2023-02-03 16:35:08,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:08,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 10: [2023-02-03 16:35:08,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 10: [2023-02-03 16:35:08,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 10: [2023-02-03 16:35:08,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 4: [2023-02-03 16:35:08,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 4: [2023-02-03 16:35:08,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:08,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:08,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:08,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:08,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 8: [2023-02-03 16:35:08,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:08,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:08,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:08,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 2: [2023-02-03 16:35:08,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:08,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:08,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:08,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:08,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:08,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:08,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:08,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:08,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 9: [2023-02-03 16:35:08,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 10: [2023-02-03 16:35:08,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 13: [2023-02-03 16:35:08,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 0: [2023-02-03 16:35:08,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 3: [2023-02-03 16:35:08,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 1: [2023-02-03 16:35:08,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 8: [2023-02-03 16:35:08,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 1: [2023-02-03 16:35:08,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 1: [2023-02-03 16:35:08,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 1: [2023-02-03 16:35:08,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 1: [2023-02-03 16:35:08,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 1: [2023-02-03 16:35:08,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt... 14: [2023-02-03 16:35:08,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 3: [2023-02-03 16:35:08,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 8: [2023-02-03 16:35:08,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 10: [2023-02-03 16:35:08,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 10: [2023-02-03 16:35:08,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 3: [2023-02-03 16:35:08,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 8: [2023-02-03 16:35:08,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 3: [2023-02-03 16:35:08,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 3: [2023-02-03 16:35:08,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 3: [2023-02-03 16:35:08,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 8: [2023-02-03 16:35:08,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,395] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,395] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 0: [2023-02-03 16:35:08,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 3: [2023-02-03 16:35:08,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 3: [2023-02-03 16:35:08,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 4: [2023-02-03 16:35:08,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 8: [2023-02-03 16:35:08,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 8: [2023-02-03 16:35:08,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 4: [2023-02-03 16:35:08,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 10: [2023-02-03 16:35:08,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 10: [2023-02-03 16:35:08,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 10: [2023-02-03 16:35:08,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 10: [2023-02-03 16:35:08,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 10: [2023-02-03 16:35:08,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 4: [2023-02-03 16:35:08,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 9: [2023-02-03 16:35:08,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 4: [2023-02-03 16:35:08,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 14: [2023-02-03 16:35:08,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 4: [2023-02-03 16:35:08,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 4: [2023-02-03 16:35:08,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 4: [2023-02-03 16:35:08,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 7: [2023-02-03 16:35:08,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 1: [2023-02-03 16:35:08,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 11: [2023-02-03 16:35:08,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 13: [2023-02-03 16:35:08,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 8: [2023-02-03 16:35:08,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 4: [2023-02-03 16:35:08,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_14-model_00-model_states.pt. 10: [2023-02-03 16:35:08,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 6: [2023-02-03 16:35:08,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 6: [2023-02-03 16:35:08,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,609] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 15: [2023-02-03 16:35:08,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 5: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 2: [2023-02-03 16:35:08,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 4: [2023-02-03 16:35:08,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 4: [2023-02-03 16:35:08,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 4: [2023-02-03 16:35:08,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 4: [2023-02-03 16:35:08,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 4: [2023-02-03 16:35:08,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 15: [2023-02-03 16:35:08,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 3: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 3: [2023-02-03 16:35:08,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 12: [2023-02-03 16:35:08,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 14: [2023-02-03 16:35:08,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 3: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 3: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 10: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 9: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 10: [2023-02-03 16:35:08,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 10: [2023-02-03 16:35:08,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 10: [2023-02-03 16:35:08,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 10: [2023-02-03 16:35:08,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 10: [2023-02-03 16:35:08,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 0: [2023-02-03 16:35:08,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:08,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 11: [2023-02-03 16:35:08,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 12: [2023-02-03 16:35:08,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 13: [2023-02-03 16:35:08,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 10: [2023-02-03 16:35:08,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 2: [2023-02-03 16:35:08,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 12: [2023-02-03 16:35:08,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 5: [2023-02-03 16:35:08,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 12: [2023-02-03 16:35:08,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 12: [2023-02-03 16:35:08,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 12: [2023-02-03 16:35:08,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 10: [2023-02-03 16:35:08,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 10: [2023-02-03 16:35:08,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 8: [2023-02-03 16:35:08,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 4: [2023-02-03 16:35:08,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 1: [2023-02-03 16:35:08,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt... 7: [2023-02-03 16:35:08,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:08,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 3: [2023-02-03 16:35:08,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 4: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 3: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 7: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 10: [2023-02-03 16:35:08,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:08,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 4: [2023-02-03 16:35:08,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:08,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 3: [2023-02-03 16:35:08,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 3: [2023-02-03 16:35:08,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 4: [2023-02-03 16:35:08,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:08,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:08,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 4: [2023-02-03 16:35:08,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 10: [2023-02-03 16:35:08,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 10: [2023-02-03 16:35:08,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 4: [2023-02-03 16:35:08,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 8: [2023-02-03 16:35:08,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 4: [2023-02-03 16:35:08,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:08,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:08,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 3: [2023-02-03 16:35:08,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 3: [2023-02-03 16:35:08,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 3: [2023-02-03 16:35:08,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 3: [2023-02-03 16:35:08,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:08,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:08,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:08,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:08,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:08,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:08,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:08,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 11: [2023-02-03 16:35:08,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:08,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 0: [2023-02-03 16:35:08,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 14: [2023-02-03 16:35:08,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 9: [2023-02-03 16:35:08,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:08,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:08,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 1: [2023-02-03 16:35:08,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_15-model_00-model_states.pt. 13: [2023-02-03 16:35:08,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:08,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:08,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:08,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:08,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:08,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:08,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:08,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:08,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:08,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 12: [2023-02-03 16:35:08,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 12: [2023-02-03 16:35:08,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 12: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 12: [2023-02-03 16:35:08,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 12: [2023-02-03 16:35:08,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 12: [2023-02-03 16:35:08,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:08,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:08,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 6: [2023-02-03 16:35:08,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 15: [2023-02-03 16:35:08,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:08,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:08,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 15: [2023-02-03 16:35:08,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:08,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:08,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:08,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:08,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:08,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:08,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:08,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:08,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:08,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:08,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:08,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 6: [2023-02-03 16:35:08,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:08,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:08,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:08,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:08,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:08,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:08,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:08,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:08,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:08,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:08,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:08,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:08,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:08,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:08,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:08,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:08,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:08,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:08,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 12: [2023-02-03 16:35:08,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:08,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 12: [2023-02-03 16:35:08,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:08,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 1: [2023-02-03 16:35:08,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:08,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:08,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:08,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:08,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:08,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:08,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:08,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:08,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:08,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:08,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:08,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:08,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:08,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:08,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:08,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:08,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:08,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:08,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:08,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:08,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:08,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:08,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 13: [2023-02-03 16:35:08,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:09,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 8: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 14: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 14: [2023-02-03 16:35:09,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:09,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:09,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:09,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:09,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:09,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:09,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 0: [2023-02-03 16:35:09,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:09,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 3: [2023-02-03 16:35:09,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 4: [2023-02-03 16:35:09,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 3: [2023-02-03 16:35:09,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:09,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 4: [2023-02-03 16:35:09,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 4: [2023-02-03 16:35:09,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 4: [2023-02-03 16:35:09,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 4: [2023-02-03 16:35:09,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 4: [2023-02-03 16:35:09,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:09,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:09,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:09,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:09,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:09,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 7: [2023-02-03 16:35:09,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:09,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 9: [2023-02-03 16:35:09,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 12: [2023-02-03 16:35:09,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:09,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:09,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 3: [2023-02-03 16:35:09,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:09,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:09,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 3: [2023-02-03 16:35:09,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 3: [2023-02-03 16:35:09,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:09,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 5: [2023-02-03 16:35:09,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 2: [2023-02-03 16:35:09,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 2: [2023-02-03 16:35:09,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 2: [2023-02-03 16:35:09,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 5: [2023-02-03 16:35:09,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 2: [2023-02-03 16:35:09,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 9: [2023-02-03 16:35:09,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 2: [2023-02-03 16:35:09,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 2: [2023-02-03 16:35:09,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:09,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:09,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:09,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 10: [2023-02-03 16:35:09,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 8: [2023-02-03 16:35:09,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 11: [2023-02-03 16:35:09,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 11: [2023-02-03 16:35:09,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 11: [2023-02-03 16:35:09,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 11: [2023-02-03 16:35:09,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 11: [2023-02-03 16:35:09,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 11: [2023-02-03 16:35:09,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 11: [2023-02-03 16:35:09,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 11: [2023-02-03 16:35:09,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:09,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:09,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 2: [2023-02-03 16:35:09,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:09,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:09,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:09,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:09,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 11: [2023-02-03 16:35:09,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt... 13: [2023-02-03 16:35:09,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:09,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:09,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:09,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:09,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:09,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:09,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:09,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:09,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:09,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:09,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,037] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,037] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,037] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:09,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,047] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,047] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:09,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:09,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 1: [2023-02-03 16:35:09,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 11: [2023-02-03 16:35:09,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 11: [2023-02-03 16:35:09,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 11: [2023-02-03 16:35:09,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:09,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 11: [2023-02-03 16:35:09,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 4: [2023-02-03 16:35:09,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 3: [2023-02-03 16:35:09,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:09,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:09,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 0: [2023-02-03 16:35:09,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 7: [2023-02-03 16:35:09,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_16-model_00-model_states.pt. 10: [2023-02-03 16:35:09,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 11: [2023-02-03 16:35:09,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 2: [2023-02-03 16:35:09,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:09,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 2: [2023-02-03 16:35:09,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 2: [2023-02-03 16:35:09,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:09,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 2: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 2: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 2: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:09,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:09,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:09,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 8: [2023-02-03 16:35:09,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:09,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 15: [2023-02-03 16:35:09,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 5: [2023-02-03 16:35:09,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:09,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:09,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:09,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 13: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 13: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 13: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 13: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:09,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 6: [2023-02-03 16:35:09,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 0: [2023-02-03 16:35:09,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 7: [2023-02-03 16:35:09,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 13: [2023-02-03 16:35:09,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 13: [2023-02-03 16:35:09,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 1: [2023-02-03 16:35:09,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 1: [2023-02-03 16:35:09,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 1: [2023-02-03 16:35:09,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 1: [2023-02-03 16:35:09,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 5: [2023-02-03 16:35:09,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 1: [2023-02-03 16:35:09,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 12: [2023-02-03 16:35:09,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 1: [2023-02-03 16:35:09,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 13: [2023-02-03 16:35:09,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 9: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 4: [2023-02-03 16:35:09,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 14: [2023-02-03 16:35:09,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 11: [2023-02-03 16:35:09,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 10: [2023-02-03 16:35:09,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt... 3: [2023-02-03 16:35:09,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 12: [2023-02-03 16:35:09,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 4: [2023-02-03 16:35:09,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 13: [2023-02-03 16:35:09,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 12: [2023-02-03 16:35:09,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 4: [2023-02-03 16:35:09,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 12: [2023-02-03 16:35:09,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 4: [2023-02-03 16:35:09,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 4: [2023-02-03 16:35:09,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 4: [2023-02-03 16:35:09,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 13: [2023-02-03 16:35:09,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 8: [2023-02-03 16:35:09,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 0: [2023-02-03 16:35:09,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 2: [2023-02-03 16:35:09,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 0: [2023-02-03 16:35:09,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 15: [2023-02-03 16:35:09,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 12: [2023-02-03 16:35:09,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 12: [2023-02-03 16:35:09,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 12: [2023-02-03 16:35:09,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 13: [2023-02-03 16:35:09,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 7: [2023-02-03 16:35:09,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 12: [2023-02-03 16:35:09,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 3: [2023-02-03 16:35:09,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 0: [2023-02-03 16:35:09,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 6: [2023-02-03 16:35:09,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 14: [2023-02-03 16:35:09,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 0: [2023-02-03 16:35:09,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 0: [2023-02-03 16:35:09,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 0: [2023-02-03 16:35:09,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 0: [2023-02-03 16:35:09,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 10: [2023-02-03 16:35:09,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 9: [2023-02-03 16:35:09,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_17-model_00-model_states.pt. 0: [2023-02-03 16:35:09,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,561] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,561] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,561] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:09,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:09,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:09,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 13: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 1: [2023-02-03 16:35:09,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 1: [2023-02-03 16:35:09,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 1: [2023-02-03 16:35:09,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 1: [2023-02-03 16:35:09,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 1: [2023-02-03 16:35:09,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 1: [2023-02-03 16:35:09,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 4: [2023-02-03 16:35:09,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 4: [2023-02-03 16:35:09,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 4: [2023-02-03 16:35:09,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 4: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 4: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 4: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 4: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 5: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 3: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 3: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 3: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 3: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 7: [2023-02-03 16:35:09,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:09,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:09,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:09,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 13: [2023-02-03 16:35:09,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:09,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,670] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,670] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,670] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,670] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,670] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,670] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 3: [2023-02-03 16:35:09,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 3: [2023-02-03 16:35:09,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 5: [2023-02-03 16:35:09,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 3: [2023-02-03 16:35:09,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:09,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:09,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 8: [2023-02-03 16:35:09,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 11: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:09,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 3: [2023-02-03 16:35:09,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 3: [2023-02-03 16:35:09,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 4: [2023-02-03 16:35:09,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:09,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 6: [2023-02-03 16:35:09,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:09,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 3: [2023-02-03 16:35:09,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 3: [2023-02-03 16:35:09,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 7: [2023-02-03 16:35:09,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:09,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:09,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:09,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 15: [2023-02-03 16:35:09,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:09,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 15: [2023-02-03 16:35:09,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:09,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:09,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 6: [2023-02-03 16:35:09,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 4: [2023-02-03 16:35:09,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:09,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:09,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 1: [2023-02-03 16:35:09,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:09,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:09,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:09,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:09,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:09,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 4: [2023-02-03 16:35:09,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 4: [2023-02-03 16:35:09,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:09,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:09,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:09,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 4: [2023-02-03 16:35:09,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:09,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 4: [2023-02-03 16:35:09,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:09,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 4: [2023-02-03 16:35:09,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:09,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:09,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 2: [2023-02-03 16:35:09,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 12: [2023-02-03 16:35:09,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 11: [2023-02-03 16:35:09,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 2: [2023-02-03 16:35:09,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:09,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:09,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 12: [2023-02-03 16:35:09,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 12: [2023-02-03 16:35:09,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 12: [2023-02-03 16:35:09,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 3: [2023-02-03 16:35:09,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:09,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:09,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:09,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:09,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:09,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:09,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:09,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:09,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:09,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:09,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 8: [2023-02-03 16:35:09,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:09,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:09,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:09,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:09,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:09,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 0: [2023-02-03 16:35:09,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 0: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 0: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 0: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 0: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 14: [2023-02-03 16:35:09,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 10: [2023-02-03 16:35:09,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt... 9: [2023-02-03 16:35:09,723] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,727] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:09,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:09,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 9: [2023-02-03 16:35:09,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:09,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:09,751] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,751] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 12: [2023-02-03 16:35:09,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:09,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:09,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:09,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:09,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:09,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:09,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:09,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:09,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:09,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,770] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,770] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 0: [2023-02-03 16:35:09,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:09,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:09,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,782] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:09,782] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:09,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:09,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:09,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:09,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 14: [2023-02-03 16:35:09,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:09,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:09,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:09,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_18-model_00-model_states.pt. 10: [2023-02-03 16:35:09,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:09,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:09,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:09,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:09,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:09,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:09,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:09,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:09,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:09,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:09,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:09,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:09,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 2: [2023-02-03 16:35:09,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 1: [2023-02-03 16:35:09,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:09,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:09,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:09,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:09,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:09,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:09,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:09,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:09,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:09,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:09,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:09,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:09,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:09,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:09,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:09,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:09,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:09,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:09,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:09,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:09,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:09,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:09,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:09,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:09,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:09,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:09,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:09,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 5: [2023-02-03 16:35:09,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:09,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:09,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 2: [2023-02-03 16:35:09,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:09,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:09,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:09,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:09,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:09,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:09,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:09,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:09,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:09,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:09,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:09,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:09,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:09,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:09,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:09,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:09,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:09,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:09,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:09,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:09,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:09,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:09,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:09,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:09,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:10,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:10,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:10,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 15: [2023-02-03 16:35:10,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:10,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:10,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:10,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:10,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:10,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 13: [2023-02-03 16:35:10,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:10,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:10,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 13: [2023-02-03 16:35:10,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:10,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:10,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:10,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:10,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 15: [2023-02-03 16:35:10,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 8: [2023-02-03 16:35:10,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:10,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:10,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 4: [2023-02-03 16:35:10,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:10,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 0: [2023-02-03 16:35:10,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 4: [2023-02-03 16:35:10,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 4: [2023-02-03 16:35:10,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:10,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 4: [2023-02-03 16:35:10,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:10,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 0: [2023-02-03 16:35:10,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:10,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 0: [2023-02-03 16:35:10,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 0: [2023-02-03 16:35:10,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:10,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:10,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 0: [2023-02-03 16:35:10,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:10,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:10,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:10,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 7: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 7: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 7: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 7: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:10,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:10,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:10,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:10,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:10,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:10,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:10,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:10,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:10,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:10,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:10,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:10,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:10,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:10,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:10,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:10,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:10,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:10,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:10,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:10,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:10,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 7: [2023-02-03 16:35:10,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:10,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:10,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:10,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:10,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:10,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 8: [2023-02-03 16:35:10,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,077] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:10,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:10,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:10,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:10,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:10,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 9: [2023-02-03 16:35:10,097] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:10,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:10,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:10,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:10,098] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:10,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:10,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:10,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 3: [2023-02-03 16:35:10,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 14: [2023-02-03 16:35:10,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:10,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:10,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:10,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 11: [2023-02-03 16:35:10,103] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,103] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 7: [2023-02-03 16:35:10,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 7: [2023-02-03 16:35:10,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:10,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:10,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:10,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 12: [2023-02-03 16:35:10,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 4: [2023-02-03 16:35:10,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 11: [2023-02-03 16:35:10,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 0: [2023-02-03 16:35:10,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 11: [2023-02-03 16:35:10,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 3: [2023-02-03 16:35:10,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 3: [2023-02-03 16:35:10,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 7: [2023-02-03 16:35:10,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 7: [2023-02-03 16:35:10,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 7: [2023-02-03 16:35:10,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,124] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,125] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,126] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 4: [2023-02-03 16:35:10,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 9: [2023-02-03 16:35:10,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 11: [2023-02-03 16:35:10,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,132] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 11: [2023-02-03 16:35:10,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 3: [2023-02-03 16:35:10,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 11: [2023-02-03 16:35:10,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 11: [2023-02-03 16:35:10,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 11: [2023-02-03 16:35:10,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 3: [2023-02-03 16:35:10,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 0: [2023-02-03 16:35:10,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 0: [2023-02-03 16:35:10,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 0: [2023-02-03 16:35:10,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 0: [2023-02-03 16:35:10,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,136] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 3: [2023-02-03 16:35:10,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 0: [2023-02-03 16:35:10,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,147] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,147] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 14: [2023-02-03 16:35:10,147] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,147] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,147] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,147] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,147] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 12: [2023-02-03 16:35:10,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 10: [2023-02-03 16:35:10,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 6: [2023-02-03 16:35:10,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 10: [2023-02-03 16:35:10,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 10: [2023-02-03 16:35:10,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 10: [2023-02-03 16:35:10,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 10: [2023-02-03 16:35:10,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 10: [2023-02-03 16:35:10,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:10,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:10,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 0: [2023-02-03 16:35:10,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,158] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 0: [2023-02-03 16:35:10,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:10,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:10,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:10,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 10: [2023-02-03 16:35:10,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt... 6: [2023-02-03 16:35:10,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,169] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:10,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:10,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:10,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:10,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:10,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:10,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:10,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:10,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 10: [2023-02-03 16:35:10,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 1: [2023-02-03 16:35:10,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:10,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:10,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:10,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:10,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:10,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:10,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:10,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 10: [2023-02-03 16:35:10,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 10: [2023-02-03 16:35:10,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 10: [2023-02-03 16:35:10,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 10: [2023-02-03 16:35:10,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:10,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_19-model_00-model_states.pt. 5: [2023-02-03 16:35:10,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:10,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:10,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 5: [2023-02-03 16:35:10,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 5: [2023-02-03 16:35:10,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 13: [2023-02-03 16:35:10,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:10,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:10,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:10,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 5: [2023-02-03 16:35:10,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 5: [2023-02-03 16:35:10,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 5: [2023-02-03 16:35:10,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 5: [2023-02-03 16:35:10,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 1: [2023-02-03 16:35:10,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 1: [2023-02-03 16:35:10,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 13: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:10,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:10,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:10,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:10,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:10,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:10,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 2: [2023-02-03 16:35:10,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 13: [2023-02-03 16:35:10,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 13: [2023-02-03 16:35:10,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 15: [2023-02-03 16:35:10,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 13: [2023-02-03 16:35:10,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 13: [2023-02-03 16:35:10,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 15: [2023-02-03 16:35:10,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 4: [2023-02-03 16:35:10,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 4: [2023-02-03 16:35:10,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 2: [2023-02-03 16:35:10,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 11: [2023-02-03 16:35:10,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 11: [2023-02-03 16:35:10,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 11: [2023-02-03 16:35:10,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 11: [2023-02-03 16:35:10,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 11: [2023-02-03 16:35:10,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 9: [2023-02-03 16:35:10,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 9: [2023-02-03 16:35:10,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,355] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,355] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,355] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,355] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 3: [2023-02-03 16:35:10,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 3: [2023-02-03 16:35:10,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 3: [2023-02-03 16:35:10,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 3: [2023-02-03 16:35:10,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 0: [2023-02-03 16:35:10,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 3: [2023-02-03 16:35:10,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 3: [2023-02-03 16:35:10,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 14: [2023-02-03 16:35:10,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 8: [2023-02-03 16:35:10,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 6: [2023-02-03 16:35:10,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 11: [2023-02-03 16:35:10,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 7: [2023-02-03 16:35:10,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 12: [2023-02-03 16:35:10,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 14: [2023-02-03 16:35:10,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 10: [2023-02-03 16:35:10,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt... 8: [2023-02-03 16:35:10,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,399] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 0: [2023-02-03 16:35:10,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,406] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,406] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 11: [2023-02-03 16:35:10,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 12: [2023-02-03 16:35:10,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 7: [2023-02-03 16:35:10,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,425] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 3: [2023-02-03 16:35:10,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 10: [2023-02-03 16:35:10,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_20-model_00-model_states.pt. 6: [2023-02-03 16:35:10,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 2: [2023-02-03 16:35:10,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 1: [2023-02-03 16:35:10,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 1: [2023-02-03 16:35:10,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 2: [2023-02-03 16:35:10,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,597] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 5: [2023-02-03 16:35:10,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 5: [2023-02-03 16:35:10,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 5: [2023-02-03 16:35:10,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 5: [2023-02-03 16:35:10,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 5: [2023-02-03 16:35:10,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 5: [2023-02-03 16:35:10,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 2: [2023-02-03 16:35:10,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 15: [2023-02-03 16:35:10,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 2: [2023-02-03 16:35:10,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 2: [2023-02-03 16:35:10,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 4: [2023-02-03 16:35:10,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 4: [2023-02-03 16:35:10,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 15: [2023-02-03 16:35:10,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,643] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 11: [2023-02-03 16:35:10,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,645] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,645] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,645] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,645] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,645] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,645] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,645] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,645] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 13: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 13: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 5: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 13: [2023-02-03 16:35:10,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 5: [2023-02-03 16:35:10,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 13: [2023-02-03 16:35:10,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 0: [2023-02-03 16:35:10,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 13: [2023-02-03 16:35:10,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 3: [2023-02-03 16:35:10,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 6: [2023-02-03 16:35:10,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 13: [2023-02-03 16:35:10,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 7: [2023-02-03 16:35:10,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 14: [2023-02-03 16:35:10,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 8: [2023-02-03 16:35:10,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 12: [2023-02-03 16:35:10,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 9: [2023-02-03 16:35:10,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 8: [2023-02-03 16:35:10,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,670] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,670] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 5: [2023-02-03 16:35:10,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,670] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 3: [2023-02-03 16:35:10,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 14: [2023-02-03 16:35:10,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,692] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 11: [2023-02-03 16:35:10,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,692] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,692] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,692] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,692] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,697] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:10,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 9: [2023-02-03 16:35:10,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 13: [2023-02-03 16:35:10,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 12: [2023-02-03 16:35:10,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 7: [2023-02-03 16:35:10,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:10,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 6: [2023-02-03 16:35:10,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:10,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:10,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:10,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:10,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:10,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 0: [2023-02-03 16:35:10,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:10,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:10,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:10,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:10,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt... 10: [2023-02-03 16:35:10,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:10,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:10,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,795] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_21-model_00-model_states.pt. 10: [2023-02-03 16:35:10,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:10,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:10,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:10,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:10,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,811] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:10,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 1: [2023-02-03 16:35:10,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:10,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:10,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:10,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:10,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 8: [2023-02-03 16:35:10,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 1: [2023-02-03 16:35:10,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:10,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:10,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 8: [2023-02-03 16:35:10,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,889] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,889] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,889] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,889] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,889] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:10,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 2: [2023-02-03 16:35:10,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 2: [2023-02-03 16:35:10,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 2: [2023-02-03 16:35:10,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 2: [2023-02-03 16:35:10,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 2: [2023-02-03 16:35:10,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 2: [2023-02-03 16:35:10,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 2: [2023-02-03 16:35:10,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 8: [2023-02-03 16:35:10,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:10,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:10,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 15: [2023-02-03 16:35:10,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 4: [2023-02-03 16:35:10,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 2: [2023-02-03 16:35:10,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:10,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:10,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:10,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 4: [2023-02-03 16:35:10,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 8: [2023-02-03 16:35:10,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 8: [2023-02-03 16:35:10,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:10,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 9: [2023-02-03 16:35:10,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 4: [2023-02-03 16:35:10,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 4: [2023-02-03 16:35:10,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:10,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:10,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 9: [2023-02-03 16:35:10,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:10,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:10,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 14: [2023-02-03 16:35:10,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:10,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 2: [2023-02-03 16:35:10,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:10,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 14: [2023-02-03 16:35:10,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:10,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 11: [2023-02-03 16:35:10,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 14: [2023-02-03 16:35:10,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 14: [2023-02-03 16:35:10,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 8: [2023-02-03 16:35:10,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:10,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:10,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:10,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:10,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:10,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:10,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:10,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:10,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 14: [2023-02-03 16:35:10,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:10,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 5: [2023-02-03 16:35:10,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:10,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:10,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 8: [2023-02-03 16:35:10,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 9: [2023-02-03 16:35:10,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:10,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:10,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 9: [2023-02-03 16:35:10,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:10,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:10,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:10,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:10,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:10,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:10,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:10,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:10,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 14: [2023-02-03 16:35:10,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:10,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 5: [2023-02-03 16:35:10,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:10,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:10,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:10,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:10,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:10,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:10,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:10,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:10,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:10,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:10,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 9: [2023-02-03 16:35:10,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 9: [2023-02-03 16:35:10,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 9: [2023-02-03 16:35:10,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:10,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:10,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:10,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:10,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:10,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:10,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,988] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:10,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:10,988] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:10,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:10,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 3: [2023-02-03 16:35:10,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 15: [2023-02-03 16:35:10,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:10,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:10,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:10,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:10,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:10,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:10,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:10,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 3: [2023-02-03 16:35:10,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:10,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:10,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 12: [2023-02-03 16:35:10,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:10,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:10,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:10,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 7: [2023-02-03 16:35:10,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:10,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:10,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:10,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:10,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 6: [2023-02-03 16:35:10,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:10,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:10,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 12: [2023-02-03 16:35:10,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:10,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:10,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:10,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:10,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 11: [2023-02-03 16:35:11,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:11,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:11,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:11,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 6: [2023-02-03 16:35:11,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:11,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:11,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:11,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:11,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:11,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:11,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:11,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:11,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:11,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 7: [2023-02-03 16:35:11,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:11,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:11,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:11,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:11,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:11,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:11,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:11,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:11,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:11,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:11,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 0: [2023-02-03 16:35:11,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:11,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:11,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:11,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:11,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:11,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:11,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:11,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:11,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:11,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 10: [2023-02-03 16:35:11,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt... 13: [2023-02-03 16:35:11,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 13: [2023-02-03 16:35:11,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:11,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 4: [2023-02-03 16:35:11,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 4: [2023-02-03 16:35:11,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 4: [2023-02-03 16:35:11,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 4: [2023-02-03 16:35:11,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 4: [2023-02-03 16:35:11,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 4: [2023-02-03 16:35:11,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 4: [2023-02-03 16:35:11,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,103] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 0: [2023-02-03 16:35:11,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 2: [2023-02-03 16:35:11,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:11,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:11,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:11,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:11,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:11,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:11,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:11,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_22-model_00-model_states.pt. 10: [2023-02-03 16:35:11,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 4: [2023-02-03 16:35:11,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,126] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 4: [2023-02-03 16:35:11,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,136] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,136] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,136] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 8: [2023-02-03 16:35:11,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 8: [2023-02-03 16:35:11,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:11,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 8: [2023-02-03 16:35:11,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 8: [2023-02-03 16:35:11,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 8: [2023-02-03 16:35:11,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 8: [2023-02-03 16:35:11,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 8: [2023-02-03 16:35:11,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:11,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 4: [2023-02-03 16:35:11,158] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,159] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 14: [2023-02-03 16:35:11,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 2: [2023-02-03 16:35:11,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:11,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:11,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:11,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 14: [2023-02-03 16:35:11,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 14: [2023-02-03 16:35:11,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 14: [2023-02-03 16:35:11,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:11,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:11,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:11,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:11,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:11,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:11,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:11,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:11,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:11,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 8: [2023-02-03 16:35:11,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 2: [2023-02-03 16:35:11,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:11,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:11,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:11,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:11,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:11,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:11,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:11,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:11,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:11,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:11,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:11,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:11,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:11,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:11,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 14: [2023-02-03 16:35:11,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 8: [2023-02-03 16:35:11,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:11,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:11,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:11,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:11,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:11,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:11,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:11,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 5: [2023-02-03 16:35:11,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 8: [2023-02-03 16:35:11,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 8: [2023-02-03 16:35:11,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 8: [2023-02-03 16:35:11,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 8: [2023-02-03 16:35:11,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 8: [2023-02-03 16:35:11,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 5: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 14: [2023-02-03 16:35:11,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:11,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 1: [2023-02-03 16:35:11,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:11,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:11,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:11,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:11,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 14: [2023-02-03 16:35:11,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 1: [2023-02-03 16:35:11,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 15: [2023-02-03 16:35:11,241] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,241] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,241] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:11,241] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:11,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:11,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 14: [2023-02-03 16:35:11,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 14: [2023-02-03 16:35:11,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:11,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:11,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:11,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:11,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:11,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 11: [2023-02-03 16:35:11,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 15: [2023-02-03 16:35:11,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 9: [2023-02-03 16:35:11,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:11,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 9: [2023-02-03 16:35:11,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 9: [2023-02-03 16:35:11,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 9: [2023-02-03 16:35:11,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 9: [2023-02-03 16:35:11,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 6: [2023-02-03 16:35:11,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 6: [2023-02-03 16:35:11,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 7: [2023-02-03 16:35:11,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 7: [2023-02-03 16:35:11,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 12: [2023-02-03 16:35:11,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 3: [2023-02-03 16:35:11,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 0: [2023-02-03 16:35:11,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 9: [2023-02-03 16:35:11,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 12: [2023-02-03 16:35:11,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 3: [2023-02-03 16:35:11,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 11: [2023-02-03 16:35:11,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 13: [2023-02-03 16:35:11,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 13: [2023-02-03 16:35:11,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 13: [2023-02-03 16:35:11,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 13: [2023-02-03 16:35:11,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 13: [2023-02-03 16:35:11,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 10: [2023-02-03 16:35:11,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 9: [2023-02-03 16:35:11,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 9: [2023-02-03 16:35:11,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 7: [2023-02-03 16:35:11,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 13: [2023-02-03 16:35:11,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt... 7: [2023-02-03 16:35:11,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 7: [2023-02-03 16:35:11,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 7: [2023-02-03 16:35:11,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 7: [2023-02-03 16:35:11,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 7: [2023-02-03 16:35:11,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 7: [2023-02-03 16:35:11,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 13: [2023-02-03 16:35:11,355] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 13: [2023-02-03 16:35:11,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 13: [2023-02-03 16:35:11,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 13: [2023-02-03 16:35:11,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 13: [2023-02-03 16:35:11,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 13: [2023-02-03 16:35:11,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 0: [2023-02-03 16:35:11,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 10: [2023-02-03 16:35:11,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_23-model_00-model_states.pt. 13: [2023-02-03 16:35:11,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 14: [2023-02-03 16:35:11,502] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 8: [2023-02-03 16:35:11,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 8: [2023-02-03 16:35:11,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 8: [2023-02-03 16:35:11,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 4: [2023-02-03 16:35:11,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 1: [2023-02-03 16:35:11,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 8: [2023-02-03 16:35:11,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 14: [2023-02-03 16:35:11,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 3: [2023-02-03 16:35:11,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 3: [2023-02-03 16:35:11,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 3: [2023-02-03 16:35:11,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 3: [2023-02-03 16:35:11,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 3: [2023-02-03 16:35:11,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 3: [2023-02-03 16:35:11,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 5: [2023-02-03 16:35:11,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 2: [2023-02-03 16:35:11,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 2: [2023-02-03 16:35:11,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 5: [2023-02-03 16:35:11,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 5: [2023-02-03 16:35:11,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 5: [2023-02-03 16:35:11,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 5: [2023-02-03 16:35:11,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 5: [2023-02-03 16:35:11,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 6: [2023-02-03 16:35:11,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 15: [2023-02-03 16:35:11,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 0: [2023-02-03 16:35:11,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 3: [2023-02-03 16:35:11,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 3: [2023-02-03 16:35:11,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 1: [2023-02-03 16:35:11,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 3: [2023-02-03 16:35:11,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 9: [2023-02-03 16:35:11,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 11: [2023-02-03 16:35:11,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 8: [2023-02-03 16:35:11,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 15: [2023-02-03 16:35:11,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 3: [2023-02-03 16:35:11,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 7: [2023-02-03 16:35:11,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 8: [2023-02-03 16:35:11,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 7: [2023-02-03 16:35:11,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 7: [2023-02-03 16:35:11,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 7: [2023-02-03 16:35:11,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 7: [2023-02-03 16:35:11,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 3: [2023-02-03 16:35:11,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,596] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 9: [2023-02-03 16:35:11,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 3: [2023-02-03 16:35:11,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,609] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,609] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 6: [2023-02-03 16:35:11,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 12: [2023-02-03 16:35:11,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,617] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 12: [2023-02-03 16:35:11,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 11: [2023-02-03 16:35:11,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 7: [2023-02-03 16:35:11,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 0: [2023-02-03 16:35:11,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 13: [2023-02-03 16:35:11,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 13: [2023-02-03 16:35:11,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt... 10: [2023-02-03 16:35:11,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 10: [2023-02-03 16:35:11,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 13: [2023-02-03 16:35:11,697] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 13: [2023-02-03 16:35:11,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 13: [2023-02-03 16:35:11,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 13: [2023-02-03 16:35:11,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 13: [2023-02-03 16:35:11,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 13: [2023-02-03 16:35:11,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 13: [2023-02-03 16:35:11,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 13: [2023-02-03 16:35:11,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_24-model_00-model_states.pt. 4: [2023-02-03 16:35:11,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 13: [2023-02-03 16:35:11,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 13: [2023-02-03 16:35:11,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,741] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 5: [2023-02-03 16:35:11,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 5: [2023-02-03 16:35:11,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 5: [2023-02-03 16:35:11,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 5: [2023-02-03 16:35:11,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 5: [2023-02-03 16:35:11,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:11,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 5: [2023-02-03 16:35:11,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 5: [2023-02-03 16:35:11,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 4: [2023-02-03 16:35:11,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:11,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 4: [2023-02-03 16:35:11,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:11,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:11,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:11,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:11,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:11,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:11,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 5: [2023-02-03 16:35:11,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 2: [2023-02-03 16:35:11,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 5: [2023-02-03 16:35:11,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:11,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:11,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:11,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:11,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:11,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:11,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:11,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 2: [2023-02-03 16:35:11,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:11,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 2: [2023-02-03 16:35:11,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:11,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 2: [2023-02-03 16:35:11,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:11,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:11,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 2: [2023-02-03 16:35:11,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 2: [2023-02-03 16:35:11,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:11,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 3: [2023-02-03 16:35:11,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 9: [2023-02-03 16:35:11,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 9: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 9: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 9: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 9: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 9: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 9: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 7: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 7: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 7: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 7: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 7: [2023-02-03 16:35:11,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 7: [2023-02-03 16:35:11,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 8: [2023-02-03 16:35:11,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:11,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,886] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 6: [2023-02-03 16:35:11,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 9: [2023-02-03 16:35:11,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:11,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 6: [2023-02-03 16:35:11,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:11,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:11,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:11,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 6: [2023-02-03 16:35:11,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 6: [2023-02-03 16:35:11,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:11,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:11,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:11,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 0: [2023-02-03 16:35:11,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:11,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:11,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 6: [2023-02-03 16:35:11,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:11,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:11,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:11,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 8: [2023-02-03 16:35:11,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:11,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:11,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:11,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:11,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 7: [2023-02-03 16:35:11,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 7: [2023-02-03 16:35:11,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 6: [2023-02-03 16:35:11,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:11,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:11,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:11,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:11,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:11,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 7: [2023-02-03 16:35:11,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:11,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 12: [2023-02-03 16:35:11,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:11,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:11,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 9: [2023-02-03 16:35:11,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 1: [2023-02-03 16:35:11,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:11,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:11,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:11,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 7: [2023-02-03 16:35:11,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:11,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:11,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 15: [2023-02-03 16:35:11,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:11,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 7: [2023-02-03 16:35:11,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:11,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 7: [2023-02-03 16:35:11,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:11,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 1: [2023-02-03 16:35:11,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:11,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:11,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:11,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:11,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 15: [2023-02-03 16:35:11,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 14: [2023-02-03 16:35:11,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:11,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:11,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 10: [2023-02-03 16:35:11,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 12: [2023-02-03 16:35:11,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 14: [2023-02-03 16:35:11,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 13: [2023-02-03 16:35:11,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 11: [2023-02-03 16:35:11,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 13: [2023-02-03 16:35:11,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt... 9: [2023-02-03 16:35:11,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:11,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:11,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:11,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:11,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:11,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 3: [2023-02-03 16:35:11,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:11,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:11,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:11,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:11,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:11,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:11,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:11,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:11,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:11,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:11,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:11,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:11,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 7: [2023-02-03 16:35:11,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:11,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:11,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:11,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:11,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 11: [2023-02-03 16:35:11,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:11,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:11,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:11,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:11,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:11,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:11,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:11,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 9: [2023-02-03 16:35:11,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:11,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 0: [2023-02-03 16:35:11,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:11,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:11,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 13: [2023-02-03 16:35:11,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,988] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:11,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_25-model_00-model_states.pt. 10: [2023-02-03 16:35:11,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:11,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:11,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:11,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:11,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:11,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:11,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:11,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:11,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:12,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:12,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:12,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 2: [2023-02-03 16:35:12,098] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,098] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,098] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 2: [2023-02-03 16:35:12,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 2: [2023-02-03 16:35:12,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:12,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 2: [2023-02-03 16:35:12,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 2: [2023-02-03 16:35:12,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 4: [2023-02-03 16:35:12,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 4: [2023-02-03 16:35:12,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 4: [2023-02-03 16:35:12,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:12,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:12,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 4: [2023-02-03 16:35:12,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:12,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:12,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:12,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:12,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:12,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:12,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:12,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:12,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 2: [2023-02-03 16:35:12,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:12,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:12,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:12,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:12,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:12,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:12,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:12,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:12,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:12,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,124] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:12,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:12,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:12,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:12,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:12,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:12,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:12,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 2: [2023-02-03 16:35:12,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 6: [2023-02-03 16:35:12,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 6: [2023-02-03 16:35:12,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:12,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 6: [2023-02-03 16:35:12,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 2: [2023-02-03 16:35:12,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 2: [2023-02-03 16:35:12,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 6: [2023-02-03 16:35:12,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 6: [2023-02-03 16:35:12,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 6: [2023-02-03 16:35:12,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:12,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,149] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,149] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:12,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 15: [2023-02-03 16:35:12,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,154] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,154] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:12,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:12,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:12,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:12,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:12,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:12,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:12,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:12,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:12,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:12,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:12,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:12,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:12,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:12,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:12,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:12,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:12,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:12,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:12,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:12,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 11: [2023-02-03 16:35:12,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:12,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 9: [2023-02-03 16:35:12,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 9: [2023-02-03 16:35:12,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:12,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:12,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 12: [2023-02-03 16:35:12,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:12,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:12,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:12,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 1: [2023-02-03 16:35:12,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 1: [2023-02-03 16:35:12,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 6: [2023-02-03 16:35:12,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 6: [2023-02-03 16:35:12,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 1: [2023-02-03 16:35:12,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 5: [2023-02-03 16:35:12,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 5: [2023-02-03 16:35:12,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 8: [2023-02-03 16:35:12,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:12,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:12,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 7: [2023-02-03 16:35:12,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 14: [2023-02-03 16:35:12,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 6: [2023-02-03 16:35:12,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 8: [2023-02-03 16:35:12,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 6: [2023-02-03 16:35:12,187] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 15: [2023-02-03 16:35:12,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 6: [2023-02-03 16:35:12,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 14: [2023-02-03 16:35:12,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 6: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:12,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:12,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:12,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 0: [2023-02-03 16:35:12,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:12,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 0: [2023-02-03 16:35:12,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 0: [2023-02-03 16:35:12,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 0: [2023-02-03 16:35:12,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 0: [2023-02-03 16:35:12,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 0: [2023-02-03 16:35:12,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:12,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:12,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:12,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:12,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:12,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:12,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 10: [2023-02-03 16:35:12,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:12,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:12,211] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:12,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:12,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 13: [2023-02-03 16:35:12,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:12,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 3: [2023-02-03 16:35:12,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 0: [2023-02-03 16:35:12,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:12,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 6: [2023-02-03 16:35:12,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:12,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:12,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 0: [2023-02-03 16:35:12,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt... 12: [2023-02-03 16:35:12,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 11: [2023-02-03 16:35:12,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 12: [2023-02-03 16:35:12,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 12: [2023-02-03 16:35:12,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 12: [2023-02-03 16:35:12,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 3: [2023-02-03 16:35:12,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 12: [2023-02-03 16:35:12,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 12: [2023-02-03 16:35:12,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 12: [2023-02-03 16:35:12,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 12: [2023-02-03 16:35:12,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 7: [2023-02-03 16:35:12,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 0: [2023-02-03 16:35:12,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 0: [2023-02-03 16:35:12,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 13: [2023-02-03 16:35:12,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 0: [2023-02-03 16:35:12,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 0: [2023-02-03 16:35:12,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 10: [2023-02-03 16:35:12,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_26-model_00-model_states.pt. 0: [2023-02-03 16:35:12,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,399] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 1: [2023-02-03 16:35:12,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 1: [2023-02-03 16:35:12,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 1: [2023-02-03 16:35:12,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 1: [2023-02-03 16:35:12,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 1: [2023-02-03 16:35:12,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 1: [2023-02-03 16:35:12,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 1: [2023-02-03 16:35:12,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 2: [2023-02-03 16:35:12,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 2: [2023-02-03 16:35:12,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,443] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 5: [2023-02-03 16:35:12,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,454] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,454] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,454] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,454] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,454] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,454] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,454] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 1: [2023-02-03 16:35:12,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 1: [2023-02-03 16:35:12,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 4: [2023-02-03 16:35:12,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 4: [2023-02-03 16:35:12,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 4: [2023-02-03 16:35:12,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 4: [2023-02-03 16:35:12,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 4: [2023-02-03 16:35:12,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 4: [2023-02-03 16:35:12,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 4: [2023-02-03 16:35:12,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 4: [2023-02-03 16:35:12,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 4: [2023-02-03 16:35:12,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 4: [2023-02-03 16:35:12,472] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 4: [2023-02-03 16:35:12,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 4: [2023-02-03 16:35:12,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 4: [2023-02-03 16:35:12,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 9: [2023-02-03 16:35:12,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 1: [2023-02-03 16:35:12,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 5: [2023-02-03 16:35:12,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 9: [2023-02-03 16:35:12,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 1: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 6: [2023-02-03 16:35:12,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 6: [2023-02-03 16:35:12,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 6: [2023-02-03 16:35:12,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 6: [2023-02-03 16:35:12,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 1: [2023-02-03 16:35:12,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 8: [2023-02-03 16:35:12,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 15: [2023-02-03 16:35:12,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 5: [2023-02-03 16:35:12,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 5: [2023-02-03 16:35:12,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 5: [2023-02-03 16:35:12,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 4: [2023-02-03 16:35:12,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 5: [2023-02-03 16:35:12,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 5: [2023-02-03 16:35:12,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 5: [2023-02-03 16:35:12,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 4: [2023-02-03 16:35:12,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 4: [2023-02-03 16:35:12,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 4: [2023-02-03 16:35:12,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 4: [2023-02-03 16:35:12,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 8: [2023-02-03 16:35:12,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 6: [2023-02-03 16:35:12,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 4: [2023-02-03 16:35:12,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 15: [2023-02-03 16:35:12,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 6: [2023-02-03 16:35:12,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,561] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,561] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 14: [2023-02-03 16:35:12,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 12: [2023-02-03 16:35:12,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 12: [2023-02-03 16:35:12,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 12: [2023-02-03 16:35:12,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 12: [2023-02-03 16:35:12,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 14: [2023-02-03 16:35:12,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 12: [2023-02-03 16:35:12,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 11: [2023-02-03 16:35:12,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 7: [2023-02-03 16:35:12,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 11: [2023-02-03 16:35:12,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 11: [2023-02-03 16:35:12,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 11: [2023-02-03 16:35:12,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 11: [2023-02-03 16:35:12,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 13: [2023-02-03 16:35:12,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 11: [2023-02-03 16:35:12,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 3: [2023-02-03 16:35:12,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 10: [2023-02-03 16:35:12,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt... 0: [2023-02-03 16:35:12,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,609] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 7: [2023-02-03 16:35:12,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 12: [2023-02-03 16:35:12,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 7: [2023-02-03 16:35:12,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 7: [2023-02-03 16:35:12,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 13: [2023-02-03 16:35:12,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 7: [2023-02-03 16:35:12,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 7: [2023-02-03 16:35:12,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 7: [2023-02-03 16:35:12,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 7: [2023-02-03 16:35:12,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 3: [2023-02-03 16:35:12,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 0: [2023-02-03 16:35:12,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_27-model_00-model_states.pt. 10: [2023-02-03 16:35:12,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:12,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:12,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:12,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:12,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:12,756] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,756] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,756] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,756] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,756] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,756] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,756] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 1: [2023-02-03 16:35:12,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:12,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 5: [2023-02-03 16:35:12,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 1: [2023-02-03 16:35:12,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:12,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 5: [2023-02-03 16:35:12,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 5: [2023-02-03 16:35:12,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:12,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 5: [2023-02-03 16:35:12,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 5: [2023-02-03 16:35:12,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 2: [2023-02-03 16:35:12,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:12,761] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 2: [2023-02-03 16:35:12,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:12,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:12,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:12,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:12,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:12,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:12,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:12,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,783] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 5: [2023-02-03 16:35:12,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:12,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:12,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:12,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:12,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:12,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:12,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:12,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 6: [2023-02-03 16:35:12,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 6: [2023-02-03 16:35:12,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 5: [2023-02-03 16:35:12,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:12,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:12,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:12,821] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 9: [2023-02-03 16:35:12,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:12,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:12,821] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,821] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,821] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,821] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,821] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,821] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 9: [2023-02-03 16:35:12,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:12,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:12,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:12,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:12,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 11: [2023-02-03 16:35:12,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 11: [2023-02-03 16:35:12,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 11: [2023-02-03 16:35:12,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 11: [2023-02-03 16:35:12,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 11: [2023-02-03 16:35:12,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 11: [2023-02-03 16:35:12,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 11: [2023-02-03 16:35:12,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 11: [2023-02-03 16:35:12,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 6: [2023-02-03 16:35:12,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 6: [2023-02-03 16:35:12,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 12: [2023-02-03 16:35:12,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 12: [2023-02-03 16:35:12,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 12: [2023-02-03 16:35:12,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 12: [2023-02-03 16:35:12,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 12: [2023-02-03 16:35:12,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 12: [2023-02-03 16:35:12,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:12,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 6: [2023-02-03 16:35:12,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:12,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 13: [2023-02-03 16:35:12,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:12,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:12,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 15: [2023-02-03 16:35:12,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:12,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:12,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 0: [2023-02-03 16:35:12,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:12,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:12,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:12,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:12,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 7: [2023-02-03 16:35:12,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 7: [2023-02-03 16:35:12,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 7: [2023-02-03 16:35:12,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 7: [2023-02-03 16:35:12,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 7: [2023-02-03 16:35:12,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 7: [2023-02-03 16:35:12,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 4: [2023-02-03 16:35:12,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:12,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 15: [2023-02-03 16:35:12,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 12: [2023-02-03 16:35:12,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 12: [2023-02-03 16:35:12,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:12,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 8: [2023-02-03 16:35:12,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:12,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:12,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:12,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:12,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 14: [2023-02-03 16:35:12,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:12,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 10: [2023-02-03 16:35:12,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 12: [2023-02-03 16:35:12,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 14: [2023-02-03 16:35:12,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 8: [2023-02-03 16:35:12,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:12,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:12,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:12,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:12,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:12,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 12: [2023-02-03 16:35:12,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 12: [2023-02-03 16:35:12,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 4: [2023-02-03 16:35:12,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:12,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 12: [2023-02-03 16:35:12,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:12,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:12,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:12,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:12,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:12,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:12,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:12,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:12,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:12,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:12,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:12,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 11: [2023-02-03 16:35:12,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:12,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:12,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 12: [2023-02-03 16:35:12,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:12,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:12,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 7: [2023-02-03 16:35:12,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:12,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:12,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 13: [2023-02-03 16:35:12,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:12,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:12,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:12,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:12,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:12,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:12,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:12,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:12,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:12,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:12,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:12,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:12,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:12,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:12,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:12,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:12,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:12,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:12,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:12,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:12,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:12,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:12,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:12,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:12,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 10: [2023-02-03 16:35:12,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:12,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 0: [2023-02-03 16:35:12,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:12,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:12,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:12,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt... 3: [2023-02-03 16:35:13,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:13,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:13,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:13,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:13,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:13,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:13,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:13,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_28-model_00-model_states.pt. 3: [2023-02-03 16:35:13,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:13,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:13,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:13,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:13,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:13,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:13,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:13,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:13,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 5: [2023-02-03 16:35:13,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:13,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:13,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:13,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:13,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:13,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:13,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:13,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:13,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:13,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:13,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:13,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:13,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 5: [2023-02-03 16:35:13,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:13,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:13,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:13,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:13,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:13,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:13,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:13,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 1: [2023-02-03 16:35:13,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 1: [2023-02-03 16:35:13,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 6: [2023-02-03 16:35:13,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 6: [2023-02-03 16:35:13,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 6: [2023-02-03 16:35:13,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 6: [2023-02-03 16:35:13,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 6: [2023-02-03 16:35:13,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:13,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:13,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:13,128] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:13,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:13,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:13,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:13,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:13,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:13,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:13,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:13,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:13,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 4: [2023-02-03 16:35:13,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:13,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:13,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:13,136] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,136] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:13,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:13,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:13,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:13,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:13,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:13,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:13,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:13,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:13,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:13,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:13,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:13,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:13,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:13,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:13,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:13,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:13,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:13,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:13,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:13,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:13,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 8: [2023-02-03 16:35:13,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 2: [2023-02-03 16:35:13,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:13,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 2: [2023-02-03 16:35:13,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:13,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:13,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:13,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:13,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 6: [2023-02-03 16:35:13,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:13,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:13,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:13,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,154] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,158] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,159] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 9: [2023-02-03 16:35:13,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:13,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:13,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:13,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 14: [2023-02-03 16:35:13,166] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:13,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:13,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:13,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 9: [2023-02-03 16:35:13,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 8: [2023-02-03 16:35:13,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 6: [2023-02-03 16:35:13,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 11: [2023-02-03 16:35:13,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 11: [2023-02-03 16:35:13,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:13,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 13: [2023-02-03 16:35:13,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 14: [2023-02-03 16:35:13,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 15: [2023-02-03 16:35:13,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:13,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:13,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:13,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 4: [2023-02-03 16:35:13,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:13,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 7: [2023-02-03 16:35:13,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 11: [2023-02-03 16:35:13,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 11: [2023-02-03 16:35:13,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 15: [2023-02-03 16:35:13,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 10: [2023-02-03 16:35:13,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 11: [2023-02-03 16:35:13,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 10: [2023-02-03 16:35:13,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 13: [2023-02-03 16:35:13,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:13,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:13,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:13,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:13,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:13,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 12: [2023-02-03 16:35:13,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 12: [2023-02-03 16:35:13,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 7: [2023-02-03 16:35:13,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 3: [2023-02-03 16:35:13,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 3: [2023-02-03 16:35:13,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt... 0: [2023-02-03 16:35:13,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 0: [2023-02-03 16:35:13,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 3: [2023-02-03 16:35:13,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 3: [2023-02-03 16:35:13,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 3: [2023-02-03 16:35:13,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_29-model_00-model_states.pt. 3: [2023-02-03 16:35:13,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 3: [2023-02-03 16:35:13,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 3: [2023-02-03 16:35:13,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 3: [2023-02-03 16:35:13,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 3: [2023-02-03 16:35:13,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 3: [2023-02-03 16:35:13,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 11: [2023-02-03 16:35:13,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 11: [2023-02-03 16:35:13,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 11: [2023-02-03 16:35:13,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 11: [2023-02-03 16:35:13,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 11: [2023-02-03 16:35:13,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 11: [2023-02-03 16:35:13,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 11: [2023-02-03 16:35:13,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,433] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,439] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 10: [2023-02-03 16:35:13,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 15: [2023-02-03 16:35:13,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,443] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 4: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 4: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 4: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 4: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 4: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 14: [2023-02-03 16:35:13,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,454] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,454] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 1: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 1: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 7: [2023-02-03 16:35:13,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 8: [2023-02-03 16:35:13,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 12: [2023-02-03 16:35:13,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 11: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 1: [2023-02-03 16:35:13,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 1: [2023-02-03 16:35:13,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 1: [2023-02-03 16:35:13,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 1: [2023-02-03 16:35:13,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,469] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 5: [2023-02-03 16:35:13,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 0: [2023-02-03 16:35:13,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 2: [2023-02-03 16:35:13,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,472] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,472] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 11: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 1: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 1: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 1: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 11: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 11: [2023-02-03 16:35:13,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 11: [2023-02-03 16:35:13,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 11: [2023-02-03 16:35:13,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 2: [2023-02-03 16:35:13,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 2: [2023-02-03 16:35:13,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 2: [2023-02-03 16:35:13,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 2: [2023-02-03 16:35:13,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 11: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 4: [2023-02-03 16:35:13,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 4: [2023-02-03 16:35:13,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,490] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,490] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,490] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 3: [2023-02-03 16:35:13,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 3: [2023-02-03 16:35:13,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 4: [2023-02-03 16:35:13,490] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 2: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 2: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 9: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 3: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 13: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt... 6: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 2: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 2: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 2: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 11: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 11: [2023-02-03 16:35:13,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 11: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 2: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 2: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 15: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 15: [2023-02-03 16:35:13,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 14: [2023-02-03 16:35:13,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 2: [2023-02-03 16:35:13,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 2: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 2: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 2: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 5: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 2: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 2: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 2: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 2: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 11: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 11: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 2: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 11: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 2: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 8: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 15: [2023-02-03 16:35:13,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 9: [2023-02-03 16:35:13,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 8: [2023-02-03 16:35:13,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,502] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,502] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,502] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 6: [2023-02-03 16:35:13,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 10: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 15: [2023-02-03 16:35:13,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 10: [2023-02-03 16:35:13,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 4: [2023-02-03 16:35:13,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 9: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 8: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_66_mp_rank_00_optim_states.pt... 6: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 1: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_11_mp_rank_00_optim_states.pt... 1: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_15_mp_rank_00_optim_states.pt... 8: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_65_mp_rank_00_optim_states.pt... 8: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_70_mp_rank_00_optim_states.pt... 8: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_64_mp_rank_00_optim_states.pt... 8: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_68_mp_rank_00_optim_states.pt... 6: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 1: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_12_mp_rank_00_optim_states.pt... 8: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_67_mp_rank_00_optim_states.pt... 1: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_14_mp_rank_00_optim_states.pt... 1: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_13_mp_rank_00_optim_states.pt... 8: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_71_mp_rank_00_optim_states.pt... 1: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_9_mp_rank_00_optim_states.pt... 8: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_69_mp_rank_00_optim_states.pt... 6: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 1: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_10_mp_rank_00_optim_states.pt... 1: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_8_mp_rank_00_optim_states.pt... 13: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 12: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 6: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 13: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 13: [2023-02-03 16:35:13,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_73_mp_rank_00_optim_states.pt... 9: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_74_mp_rank_00_optim_states.pt... 9: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_76_mp_rank_00_optim_states.pt... 9: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_75_mp_rank_00_optim_states.pt... 9: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_78_mp_rank_00_optim_states.pt... 9: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_79_mp_rank_00_optim_states.pt... 9: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_77_mp_rank_00_optim_states.pt... 12: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 4: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 4: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 9: [2023-02-03 16:35:13,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_72_mp_rank_00_optim_states.pt... 14: [2023-02-03 16:35:13,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 14: [2023-02-03 16:35:13,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 7: [2023-02-03 16:35:13,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 10: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_80_mp_rank_00_optim_states.pt... 10: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_82_mp_rank_00_optim_states.pt... 10: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_83_mp_rank_00_optim_states.pt... 10: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_85_mp_rank_00_optim_states.pt... 10: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_81_mp_rank_00_optim_states.pt... 10: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_87_mp_rank_00_optim_states.pt... 10: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_86_mp_rank_00_optim_states.pt... 10: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_84_mp_rank_00_optim_states.pt... 12: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 12: [2023-02-03 16:35:13,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 11: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_90_mp_rank_00_optim_states.pt... 11: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_88_mp_rank_00_optim_states.pt... 11: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_94_mp_rank_00_optim_states.pt... 11: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_93_mp_rank_00_optim_states.pt... 11: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_89_mp_rank_00_optim_states.pt... 11: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_92_mp_rank_00_optim_states.pt... 11: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_91_mp_rank_00_optim_states.pt... 11: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_95_mp_rank_00_optim_states.pt... 7: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 5: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_44_mp_rank_00_optim_states.pt... 5: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_46_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 5: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_47_mp_rank_00_optim_states.pt... 5: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_42_mp_rank_00_optim_states.pt... 5: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_40_mp_rank_00_optim_states.pt... 5: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_41_mp_rank_00_optim_states.pt... 5: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_45_mp_rank_00_optim_states.pt... 5: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_43_mp_rank_00_optim_states.pt... 7: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 7: [2023-02-03 16:35:13,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 7: [2023-02-03 16:35:13,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 2: [2023-02-03 16:35:13,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_16_mp_rank_00_optim_states.pt... 2: [2023-02-03 16:35:13,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_23_mp_rank_00_optim_states.pt... 2: [2023-02-03 16:35:13,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_19_mp_rank_00_optim_states.pt... 2: [2023-02-03 16:35:13,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_22_mp_rank_00_optim_states.pt... 2: [2023-02-03 16:35:13,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_17_mp_rank_00_optim_states.pt... 2: [2023-02-03 16:35:13,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_20_mp_rank_00_optim_states.pt... 2: [2023-02-03 16:35:13,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_18_mp_rank_00_optim_states.pt... 2: [2023-02-03 16:35:13,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_21_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 3: [2023-02-03 16:35:13,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 0: [2023-02-03 16:35:13,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_30-model_00-model_states.pt. 13: [2023-02-03 16:35:13,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_105_mp_rank_00_optim_states.pt... 13: [2023-02-03 16:35:13,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_106_mp_rank_00_optim_states.pt... 13: [2023-02-03 16:35:13,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_109_mp_rank_00_optim_states.pt... 13: [2023-02-03 16:35:13,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_104_mp_rank_00_optim_states.pt... 13: [2023-02-03 16:35:13,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_107_mp_rank_00_optim_states.pt... 13: [2023-02-03 16:35:13,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_110_mp_rank_00_optim_states.pt... 13: [2023-02-03 16:35:13,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_108_mp_rank_00_optim_states.pt... 13: [2023-02-03 16:35:13,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_111_mp_rank_00_optim_states.pt... 0: [2023-02-03 16:35:13,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 6: [2023-02-03 16:35:13,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_52_mp_rank_00_optim_states.pt... 6: [2023-02-03 16:35:13,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_50_mp_rank_00_optim_states.pt... 6: [2023-02-03 16:35:13,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_48_mp_rank_00_optim_states.pt... 6: [2023-02-03 16:35:13,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_53_mp_rank_00_optim_states.pt... 6: [2023-02-03 16:35:13,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_54_mp_rank_00_optim_states.pt... 6: [2023-02-03 16:35:13,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_49_mp_rank_00_optim_states.pt... 6: [2023-02-03 16:35:13,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_55_mp_rank_00_optim_states.pt... 6: [2023-02-03 16:35:13,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_51_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 15: [2023-02-03 16:35:13,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_121_mp_rank_00_optim_states.pt... 15: [2023-02-03 16:35:13,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_122_mp_rank_00_optim_states.pt... 15: [2023-02-03 16:35:13,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_126_mp_rank_00_optim_states.pt... 15: [2023-02-03 16:35:13,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_120_mp_rank_00_optim_states.pt... 15: [2023-02-03 16:35:13,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_124_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 15: [2023-02-03 16:35:13,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_123_mp_rank_00_optim_states.pt... 15: [2023-02-03 16:35:13,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_125_mp_rank_00_optim_states.pt... 15: [2023-02-03 16:35:13,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_127_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: > using checkpoint value 0.0002 for learning rate 3: [2023-02-03 16:35:13,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: > using checkpoint value 2e-05 for minimum learning rate 0: > using checkpoint value 321098 for warmup iterations 0: > using checkpoint value 32109839 for total number of iterations 0: > using checkpoint value cosine for decay style 3: [2023-02-03 16:35:13,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 14: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_116_mp_rank_00_optim_states.pt... 14: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_117_mp_rank_00_optim_states.pt... 14: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_113_mp_rank_00_optim_states.pt... 14: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_112_mp_rank_00_optim_states.pt... 14: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_115_mp_rank_00_optim_states.pt... 14: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_118_mp_rank_00_optim_states.pt... 14: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_119_mp_rank_00_optim_states.pt... 14: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_114_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 3: [2023-02-03 16:35:13,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 3: [2023-02-03 16:35:13,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 0: [2023-02-03 16:35:13,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt... 0: [2023-02-03 16:35:13,560] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/layer_32-model_00-model_states.pt. 12: [2023-02-03 16:35:13,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_96_mp_rank_00_optim_states.pt... 12: [2023-02-03 16:35:13,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_101_mp_rank_00_optim_states.pt... 12: [2023-02-03 16:35:13,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_99_mp_rank_00_optim_states.pt... 12: [2023-02-03 16:35:13,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_102_mp_rank_00_optim_states.pt... 12: [2023-02-03 16:35:13,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_100_mp_rank_00_optim_states.pt... 12: [2023-02-03 16:35:13,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_98_mp_rank_00_optim_states.pt... 12: [2023-02-03 16:35:13,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_97_mp_rank_00_optim_states.pt... 12: [2023-02-03 16:35:13,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_103_mp_rank_00_optim_states.pt... 7: [2023-02-03 16:35:13,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_58_mp_rank_00_optim_states.pt... 7: [2023-02-03 16:35:13,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_62_mp_rank_00_optim_states.pt... 7: [2023-02-03 16:35:13,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_57_mp_rank_00_optim_states.pt... 7: [2023-02-03 16:35:13,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_60_mp_rank_00_optim_states.pt... 7: [2023-02-03 16:35:13,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_59_mp_rank_00_optim_states.pt... 7: [2023-02-03 16:35:13,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_56_mp_rank_00_optim_states.pt... 7: [2023-02-03 16:35:13,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_61_mp_rank_00_optim_states.pt... 7: [2023-02-03 16:35:13,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_63_mp_rank_00_optim_states.pt... 4: [2023-02-03 16:35:13,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_33_mp_rank_00_optim_states.pt... 4: [2023-02-03 16:35:13,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_34_mp_rank_00_optim_states.pt... 4: [2023-02-03 16:35:13,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_36_mp_rank_00_optim_states.pt... 4: [2023-02-03 16:35:13,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_35_mp_rank_00_optim_states.pt... 4: [2023-02-03 16:35:13,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_38_mp_rank_00_optim_states.pt... 4: [2023-02-03 16:35:13,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_37_mp_rank_00_optim_states.pt... 4: [2023-02-03 16:35:13,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_32_mp_rank_00_optim_states.pt... 4: [2023-02-03 16:35:13,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_39_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_24_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_27_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_25_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_31_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_29_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_26_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_30_mp_rank_00_optim_states.pt... 3: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_28_mp_rank_00_optim_states.pt... 0: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt... 0: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt... 0: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_2_mp_rank_00_optim_states.pt... 0: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_5_mp_rank_00_optim_states.pt... 0: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_4_mp_rank_00_optim_states.pt... 0: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_3_mp_rank_00_optim_states.pt... 0: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_6_mp_rank_00_optim_states.pt... 0: [2023-02-03 16:35:13,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_7_mp_rank_00_optim_states.pt... 8: [2023-02-03 16:35:13,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_65_mp_rank_00_optim_states.pt. 8: [2023-02-03 16:35:13,867] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 65 5: [2023-02-03 16:35:13,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_40_mp_rank_00_optim_states.pt. 5: [2023-02-03 16:35:13,937] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 40 4: [2023-02-03 16:35:13,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_32_mp_rank_00_optim_states.pt. 2: [2023-02-03 16:35:13,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_16_mp_rank_00_optim_states.pt. 4: [2023-02-03 16:35:13,938] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 32 2: [2023-02-03 16:35:13,939] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 16 1: [2023-02-03 16:35:13,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_12_mp_rank_00_optim_states.pt. 1: [2023-02-03 16:35:13,941] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 12 9: [2023-02-03 16:35:13,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_76_mp_rank_00_optim_states.pt. 9: [2023-02-03 16:35:13,947] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 76 1: [2023-02-03 16:35:13,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_14_mp_rank_00_optim_states.pt. 1: [2023-02-03 16:35:13,953] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 14 2: [2023-02-03 16:35:13,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_17_mp_rank_00_optim_states.pt. 2: [2023-02-03 16:35:13,956] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 17 9: [2023-02-03 16:35:13,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_79_mp_rank_00_optim_states.pt. 9: [2023-02-03 16:35:13,958] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 79 0: [2023-02-03 16:35:13,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_7_mp_rank_00_optim_states.pt. 0: [2023-02-03 16:35:13,970] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 7 9: [2023-02-03 16:35:13,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_78_mp_rank_00_optim_states.pt. 9: [2023-02-03 16:35:13,972] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 78 8: [2023-02-03 16:35:13,972] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 65 4: [2023-02-03 16:35:13,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_38_mp_rank_00_optim_states.pt. 4: [2023-02-03 16:35:13,973] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 38 5: [2023-02-03 16:35:13,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_45_mp_rank_00_optim_states.pt. 5: [2023-02-03 16:35:13,976] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 45 14: [2023-02-03 16:35:13,984] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_112_mp_rank_00_optim_states.pt. 14: [2023-02-03 16:35:13,984] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 112 6: [2023-02-03 16:35:13,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_55_mp_rank_00_optim_states.pt. 6: [2023-02-03 16:35:13,985] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 55 2: [2023-02-03 16:35:13,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_21_mp_rank_00_optim_states.pt. 2: [2023-02-03 16:35:13,988] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 21 5: [2023-02-03 16:35:13,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_44_mp_rank_00_optim_states.pt. 5: [2023-02-03 16:35:13,991] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 44 6: [2023-02-03 16:35:13,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_54_mp_rank_00_optim_states.pt. 6: [2023-02-03 16:35:13,993] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 54 4: [2023-02-03 16:35:14,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_33_mp_rank_00_optim_states.pt. 4: [2023-02-03 16:35:14,005] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 33 1: [2023-02-03 16:35:14,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_11_mp_rank_00_optim_states.pt. 1: [2023-02-03 16:35:14,009] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 11 2: [2023-02-03 16:35:14,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_22_mp_rank_00_optim_states.pt. 2: [2023-02-03 16:35:14,014] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 22 8: [2023-02-03 16:35:14,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_64_mp_rank_00_optim_states.pt. 5: [2023-02-03 16:35:14,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_43_mp_rank_00_optim_states.pt. 5: [2023-02-03 16:35:14,016] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 43 8: [2023-02-03 16:35:14,016] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 64 9: [2023-02-03 16:35:14,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_72_mp_rank_00_optim_states.pt. 9: [2023-02-03 16:35:14,022] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 72 0: [2023-02-03 16:35:14,023] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 7 1: [2023-02-03 16:35:14,023] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 14 6: [2023-02-03 16:35:14,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_50_mp_rank_00_optim_states.pt. 6: [2023-02-03 16:35:14,025] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 50 5: [2023-02-03 16:35:14,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_47_mp_rank_00_optim_states.pt. 5: [2023-02-03 16:35:14,025] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 47 14: [2023-02-03 16:35:14,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_118_mp_rank_00_optim_states.pt. 14: [2023-02-03 16:35:14,028] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 118 11: [2023-02-03 16:35:14,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_88_mp_rank_00_optim_states.pt. 11: [2023-02-03 16:35:14,029] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 88 4: [2023-02-03 16:35:14,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_35_mp_rank_00_optim_states.pt. 4: [2023-02-03 16:35:14,035] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 35 14: [2023-02-03 16:35:14,037] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_114_mp_rank_00_optim_states.pt. 14: [2023-02-03 16:35:14,037] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 114 6: [2023-02-03 16:35:14,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_52_mp_rank_00_optim_states.pt. 6: [2023-02-03 16:35:14,038] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 52 8: [2023-02-03 16:35:14,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_66_mp_rank_00_optim_states.pt. 8: [2023-02-03 16:35:14,039] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 66 8: [2023-02-03 16:35:14,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_70_mp_rank_00_optim_states.pt. 8: [2023-02-03 16:35:14,043] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 70 1: [2023-02-03 16:35:14,043] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 12 8: [2023-02-03 16:35:14,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_67_mp_rank_00_optim_states.pt. 8: [2023-02-03 16:35:14,043] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 67 5: [2023-02-03 16:35:14,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_42_mp_rank_00_optim_states.pt. 14: [2023-02-03 16:35:14,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_116_mp_rank_00_optim_states.pt. 14: [2023-02-03 16:35:14,046] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 116 5: [2023-02-03 16:35:14,046] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 42 9: [2023-02-03 16:35:14,047] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 76 6: [2023-02-03 16:35:14,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_53_mp_rank_00_optim_states.pt. 9: [2023-02-03 16:35:14,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_73_mp_rank_00_optim_states.pt. 6: [2023-02-03 16:35:14,048] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 53 9: [2023-02-03 16:35:14,049] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 73 1: [2023-02-03 16:35:14,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_9_mp_rank_00_optim_states.pt. 4: [2023-02-03 16:35:14,049] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 38 1: [2023-02-03 16:35:14,049] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 9 8: [2023-02-03 16:35:14,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_71_mp_rank_00_optim_states.pt. 8: [2023-02-03 16:35:14,051] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 71 6: [2023-02-03 16:35:14,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_48_mp_rank_00_optim_states.pt. 6: [2023-02-03 16:35:14,054] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 48 0: [2023-02-03 16:35:14,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_2_mp_rank_00_optim_states.pt. 0: [2023-02-03 16:35:14,055] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 2 0: [2023-02-03 16:35:14,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt. 0: [2023-02-03 16:35:14,056] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 1 2: [2023-02-03 16:35:14,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_19_mp_rank_00_optim_states.pt. 2: [2023-02-03 16:35:14,057] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 19 9: [2023-02-03 16:35:14,057] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 79 14: [2023-02-03 16:35:14,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_119_mp_rank_00_optim_states.pt. 14: [2023-02-03 16:35:14,058] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 119 4: [2023-02-03 16:35:14,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_34_mp_rank_00_optim_states.pt. 4: [2023-02-03 16:35:14,065] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 34 5: [2023-02-03 16:35:14,072] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 40 5: [2023-02-03 16:35:14,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_41_mp_rank_00_optim_states.pt. 5: [2023-02-03 16:35:14,073] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 41 14: [2023-02-03 16:35:14,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_115_mp_rank_00_optim_states.pt. 14: [2023-02-03 16:35:14,078] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 115 5: [2023-02-03 16:35:14,084] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 44 12: [2023-02-03 16:35:14,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_96_mp_rank_00_optim_states.pt. 15: [2023-02-03 16:35:14,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_125_mp_rank_00_optim_states.pt. 11: [2023-02-03 16:35:14,079] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 88 12: [2023-02-03 16:35:14,087] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 96 15: [2023-02-03 16:35:14,086] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 125 11: [2023-02-03 16:35:14,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_95_mp_rank_00_optim_states.pt. 6: [2023-02-03 16:35:14,089] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 54 11: [2023-02-03 16:35:14,089] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 95 8: [2023-02-03 16:35:14,091] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 64 8: [2023-02-03 16:35:14,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_68_mp_rank_00_optim_states.pt. 8: [2023-02-03 16:35:14,095] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 68 0: [2023-02-03 16:35:14,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_5_mp_rank_00_optim_states.pt. 0: [2023-02-03 16:35:14,097] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 5 1: [2023-02-03 16:35:14,097] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_15_mp_rank_00_optim_states.pt. 2: [2023-02-03 16:35:14,098] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_20_mp_rank_00_optim_states.pt. 1: [2023-02-03 16:35:14,098] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 15 2: [2023-02-03 16:35:14,098] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 20 14: [2023-02-03 16:35:14,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_117_mp_rank_00_optim_states.pt. 14: [2023-02-03 16:35:14,099] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 117 2: [2023-02-03 16:35:14,100] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 16 10: [2023-02-03 16:35:14,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_80_mp_rank_00_optim_states.pt. 10: [2023-02-03 16:35:14,102] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 80 8: [2023-02-03 16:35:14,103] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_69_mp_rank_00_optim_states.pt. 8: [2023-02-03 16:35:14,103] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 69 6: [2023-02-03 16:35:14,107] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 55 5: [2023-02-03 16:35:14,110] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 45 13: [2023-02-03 16:35:14,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_104_mp_rank_00_optim_states.pt. 13: [2023-02-03 16:35:14,113] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 104 1: [2023-02-03 16:35:14,125] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 9 0: [2023-02-03 16:35:14,126] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_3_mp_rank_00_optim_states.pt. 0: [2023-02-03 16:35:14,126] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 3 15: [2023-02-03 16:35:14,130] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 125 1: [2023-02-03 16:35:14,130] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_13_mp_rank_00_optim_states.pt. 1: [2023-02-03 16:35:14,131] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 13 0: [2023-02-03 16:35:14,131] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 2 13: [2023-02-03 16:35:14,132] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_110_mp_rank_00_optim_states.pt. 13: [2023-02-03 16:35:14,132] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 110 1: [2023-02-03 16:35:14,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_10_mp_rank_00_optim_states.pt. 1: [2023-02-03 16:35:14,137] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 10 9: [2023-02-03 16:35:14,139] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 78 6: [2023-02-03 16:35:14,142] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 50 1: [2023-02-03 16:35:14,142] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 11 15: [2023-02-03 16:35:14,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_120_mp_rank_00_optim_states.pt. 15: [2023-02-03 16:35:14,145] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 120 12: [2023-02-03 16:35:14,145] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 96 9: [2023-02-03 16:35:14,149] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_75_mp_rank_00_optim_states.pt. 9: [2023-02-03 16:35:14,150] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 75 10: [2023-02-03 16:35:14,148] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 80 6: [2023-02-03 16:35:14,151] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_49_mp_rank_00_optim_states.pt. 6: [2023-02-03 16:35:14,151] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 49 15: [2023-02-03 16:35:14,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_123_mp_rank_00_optim_states.pt. 15: [2023-02-03 16:35:14,153] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 123 10: [2023-02-03 16:35:14,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_85_mp_rank_00_optim_states.pt. 10: [2023-02-03 16:35:14,155] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 85 11: [2023-02-03 16:35:14,157] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 95 5: [2023-02-03 16:35:14,157] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 41 0: [2023-02-03 16:35:14,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt. 0: [2023-02-03 16:35:14,161] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 0 13: [2023-02-03 16:35:14,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_109_mp_rank_00_optim_states.pt. 13: [2023-02-03 16:35:14,165] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 109 0: [2023-02-03 16:35:14,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_4_mp_rank_00_optim_states.pt. 0: [2023-02-03 16:35:14,166] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 4 10: [2023-02-03 16:35:14,166] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_84_mp_rank_00_optim_states.pt. 10: [2023-02-03 16:35:14,166] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 84 14: [2023-02-03 16:35:14,167] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 112 14: [2023-02-03 16:35:14,170] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 116 14: [2023-02-03 16:35:14,171] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 118 11: [2023-02-03 16:35:14,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_90_mp_rank_00_optim_states.pt. 11: [2023-02-03 16:35:14,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_94_mp_rank_00_optim_states.pt. 11: [2023-02-03 16:35:14,172] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 90 11: [2023-02-03 16:35:14,172] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 94 0: [2023-02-03 16:35:14,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_6_mp_rank_00_optim_states.pt. 7: [2023-02-03 16:35:14,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_56_mp_rank_00_optim_states.pt. 0: [2023-02-03 16:35:14,173] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 6 13: [2023-02-03 16:35:14,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_105_mp_rank_00_optim_states.pt. 7: [2023-02-03 16:35:14,173] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 56 13: [2023-02-03 16:35:14,173] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 105 12: [2023-02-03 16:35:14,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_99_mp_rank_00_optim_states.pt. 12: [2023-02-03 16:35:14,175] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 99 14: [2023-02-03 16:35:14,177] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 114 5: [2023-02-03 16:35:14,175] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 43 13: [2023-02-03 16:35:14,180] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 104 12: [2023-02-03 16:35:14,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_102_mp_rank_00_optim_states.pt. 12: [2023-02-03 16:35:14,181] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 102 4: [2023-02-03 16:35:14,184] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 35 12: [2023-02-03 16:35:14,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_101_mp_rank_00_optim_states.pt. 12: [2023-02-03 16:35:14,185] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 101 7: [2023-02-03 16:35:14,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_60_mp_rank_00_optim_states.pt. 7: [2023-02-03 16:35:14,193] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 60 1: [2023-02-03 16:35:14,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_8_mp_rank_00_optim_states.pt. 1: [2023-02-03 16:35:14,197] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 8 4: [2023-02-03 16:35:14,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_37_mp_rank_00_optim_states.pt. 4: [2023-02-03 16:35:14,198] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 37 10: [2023-02-03 16:35:14,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_83_mp_rank_00_optim_states.pt. 10: [2023-02-03 16:35:14,200] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 83 0: [2023-02-03 16:35:14,202] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 3 9: [2023-02-03 16:35:14,206] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 72 1: [2023-02-03 16:35:14,207] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 10 10: [2023-02-03 16:35:14,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_82_mp_rank_00_optim_states.pt. 10: [2023-02-03 16:35:14,209] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 82 4: [2023-02-03 16:35:14,213] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 32 3: [2023-02-03 16:35:14,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_31_mp_rank_00_optim_states.pt. 3: [2023-02-03 16:35:14,220] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 31 3: [2023-02-03 16:35:14,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_27_mp_rank_00_optim_states.pt. 3: [2023-02-03 16:35:14,220] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 27 13: [2023-02-03 16:35:14,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_108_mp_rank_00_optim_states.pt. 13: [2023-02-03 16:35:14,221] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 108 12: [2023-02-03 16:35:14,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_97_mp_rank_00_optim_states.pt. 12: [2023-02-03 16:35:14,221] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 97 12: [2023-02-03 16:35:14,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_100_mp_rank_00_optim_states.pt. 12: [2023-02-03 16:35:14,222] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 100 15: [2023-02-03 16:35:14,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_127_mp_rank_00_optim_states.pt. 15: [2023-02-03 16:35:14,222] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 127 10: [2023-02-03 16:35:14,223] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 85 2: [2023-02-03 16:35:14,226] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 22 14: [2023-02-03 16:35:14,226] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 119 5: [2023-02-03 16:35:14,226] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 42 12: [2023-02-03 16:35:14,226] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 99 15: [2023-02-03 16:35:14,229] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 120 4: [2023-02-03 16:35:14,231] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 33 7: [2023-02-03 16:35:14,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_63_mp_rank_00_optim_states.pt. 7: [2023-02-03 16:35:14,238] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 63 7: [2023-02-03 16:35:14,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_61_mp_rank_00_optim_states.pt. 7: [2023-02-03 16:35:14,238] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 61 10: [2023-02-03 16:35:14,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_86_mp_rank_00_optim_states.pt. 10: [2023-02-03 16:35:14,239] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 86 6: [2023-02-03 16:35:14,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_51_mp_rank_00_optim_states.pt. 6: [2023-02-03 16:35:14,243] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 51 13: [2023-02-03 16:35:14,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_111_mp_rank_00_optim_states.pt. 13: [2023-02-03 16:35:14,243] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 111 13: [2023-02-03 16:35:14,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_106_mp_rank_00_optim_states.pt. 13: [2023-02-03 16:35:14,247] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 106 15: [2023-02-03 16:35:14,247] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 123 10: [2023-02-03 16:35:14,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_81_mp_rank_00_optim_states.pt. 10: [2023-02-03 16:35:14,253] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 81 6: [2023-02-03 16:35:14,253] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 49 12: [2023-02-03 16:35:14,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_98_mp_rank_00_optim_states.pt. 12: [2023-02-03 16:35:14,254] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 98 12: [2023-02-03 16:35:14,257] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 102 7: [2023-02-03 16:35:14,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_62_mp_rank_00_optim_states.pt. 7: [2023-02-03 16:35:14,259] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 62 3: [2023-02-03 16:35:14,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_28_mp_rank_00_optim_states.pt. 3: [2023-02-03 16:35:14,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_30_mp_rank_00_optim_states.pt. 3: [2023-02-03 16:35:14,261] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 28 3: [2023-02-03 16:35:14,261] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 30 7: [2023-02-03 16:35:14,261] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 60 1: [2023-02-03 16:35:14,264] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 13 15: [2023-02-03 16:35:14,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_121_mp_rank_00_optim_states.pt. 15: [2023-02-03 16:35:14,266] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 121 7: [2023-02-03 16:35:14,270] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 56 8: [2023-02-03 16:35:14,271] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 66 12: [2023-02-03 16:35:14,271] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 101 14: [2023-02-03 16:35:14,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_113_mp_rank_00_optim_states.pt. 14: [2023-02-03 16:35:14,273] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 113 11: [2023-02-03 16:35:14,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_89_mp_rank_00_optim_states.pt. 11: [2023-02-03 16:35:14,279] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 89 3: [2023-02-03 16:35:14,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_24_mp_rank_00_optim_states.pt. 3: [2023-02-03 16:35:14,285] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 24 10: [2023-02-03 16:35:14,286] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 84 15: [2023-02-03 16:35:14,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_126_mp_rank_00_optim_states.pt. 15: [2023-02-03 16:35:14,289] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 126 9: [2023-02-03 16:35:14,291] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 75 8: [2023-02-03 16:35:14,294] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 68 7: [2023-02-03 16:35:14,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_57_mp_rank_00_optim_states.pt. 2: [2023-02-03 16:35:14,295] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 19 7: [2023-02-03 16:35:14,295] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 57 2: [2023-02-03 16:35:14,296] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 21 7: [2023-02-03 16:35:14,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_58_mp_rank_00_optim_states.pt. 7: [2023-02-03 16:35:14,299] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 58 3: [2023-02-03 16:35:14,300] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 27 10: [2023-02-03 16:35:14,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_87_mp_rank_00_optim_states.pt. 10: [2023-02-03 16:35:14,303] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 87 6: [2023-02-03 16:35:14,303] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 52 11: [2023-02-03 16:35:14,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_93_mp_rank_00_optim_states.pt. 11: [2023-02-03 16:35:14,305] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 93 3: [2023-02-03 16:35:14,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_25_mp_rank_00_optim_states.pt. 3: [2023-02-03 16:35:14,307] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 25 2: [2023-02-03 16:35:14,036] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 17 9: [2023-02-03 16:35:14,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_77_mp_rank_00_optim_states.pt. 5: [2023-02-03 16:35:14,312] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 47 9: [2023-02-03 16:35:14,314] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 77 3: [2023-02-03 16:35:14,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_29_mp_rank_00_optim_states.pt. 3: [2023-02-03 16:35:14,315] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 29 7: [2023-02-03 16:35:14,315] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 61 4: [2023-02-03 16:35:14,316] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 34 15: [2023-02-03 16:35:14,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_122_mp_rank_00_optim_states.pt. 15: [2023-02-03 16:35:14,317] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 122 11: [2023-02-03 16:35:14,321] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 94 12: [2023-02-03 16:35:14,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_103_mp_rank_00_optim_states.pt. 12: [2023-02-03 16:35:14,323] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 103 11: [2023-02-03 16:35:14,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_91_mp_rank_00_optim_states.pt. 11: [2023-02-03 16:35:14,326] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 91 8: [2023-02-03 16:35:14,327] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 69 9: [2023-02-03 16:35:14,327] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 73 11: [2023-02-03 16:35:14,328] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 90 4: [2023-02-03 16:35:14,331] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 37 0: [2023-02-03 16:35:14,333] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 6 3: [2023-02-03 16:35:14,343] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 28 3: [2023-02-03 16:35:14,344] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 30 15: [2023-02-03 16:35:14,347] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 121 1: [2023-02-03 16:35:14,348] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 8 0: [2023-02-03 16:35:14,348] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 5 14: [2023-02-03 16:35:14,351] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 113 3: [2023-02-03 16:35:14,352] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 31 13: [2023-02-03 16:35:14,352] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 108 1: [2023-02-03 16:35:14,356] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 15 15: [2023-02-03 16:35:14,357] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 127 0: [2023-02-03 16:35:14,358] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 1 12: [2023-02-03 16:35:14,359] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 98 5: [2023-02-03 16:35:14,361] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_46_mp_rank_00_optim_states.pt. 5: [2023-02-03 16:35:14,362] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 46 13: [2023-02-03 16:35:14,364] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 105 2: [2023-02-03 16:35:14,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_18_mp_rank_00_optim_states.pt. 2: [2023-02-03 16:35:14,366] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 18 11: [2023-02-03 16:35:14,366] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 93 13: [2023-02-03 16:35:14,367] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 111 13: [2023-02-03 16:35:14,369] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 106 9: [2023-02-03 16:35:14,370] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 77 8: [2023-02-03 16:35:14,374] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 67 8: [2023-02-03 16:35:14,374] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 70 14: [2023-02-03 16:35:14,375] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 117 15: [2023-02-03 16:35:14,377] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 126 11: [2023-02-03 16:35:14,394] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 89 10: [2023-02-03 16:35:14,402] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 87 7: [2023-02-03 16:35:14,406] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_59_mp_rank_00_optim_states.pt. 7: [2023-02-03 16:35:14,407] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 59 5: [2023-02-03 16:35:14,415] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 46 2: [2023-02-03 16:35:14,416] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 20 0: [2023-02-03 16:35:14,422] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 4 13: [2023-02-03 16:35:14,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_107_mp_rank_00_optim_states.pt. 13: [2023-02-03 16:35:14,422] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 107 0: [2023-02-03 16:35:14,422] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 0 0: checkpoint version 3.0 3: [2023-02-03 16:35:14,427] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 24 13: [2023-02-03 16:35:14,433] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 110 6: [2023-02-03 16:35:14,435] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 51 7: [2023-02-03 16:35:14,435] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 57 7: [2023-02-03 16:35:14,435] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 58 7: [2023-02-03 16:35:14,439] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 63 8: [2023-02-03 16:35:14,446] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 71 15: [2023-02-03 16:35:14,453] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 122 15: [2023-02-03 16:35:14,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_124_mp_rank_00_optim_states.pt. 15: [2023-02-03 16:35:14,458] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 124 10: [2023-02-03 16:35:14,459] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 81 7: [2023-02-03 16:35:14,463] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 59 7: [2023-02-03 16:35:14,468] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 62 13: [2023-02-03 16:35:14,483] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 109 12: [2023-02-03 16:35:14,492] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 100 6: [2023-02-03 16:35:14,495] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 48 14: [2023-02-03 16:35:14,500] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 115 12: [2023-02-03 16:35:14,502] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 97 6: [2023-02-03 16:35:14,511] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 53 11: [2023-02-03 16:35:14,516] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 91 2: [2023-02-03 16:35:14,535] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 18 13: [2023-02-03 16:35:14,546] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 107 10: [2023-02-03 16:35:14,558] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 82 3: [2023-02-03 16:35:14,574] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 29 12: [2023-02-03 16:35:14,578] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 103 10: [2023-02-03 16:35:14,586] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 83 3: [2023-02-03 16:35:14,589] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 25 10: [2023-02-03 16:35:14,661] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 86 15: [2023-02-03 16:35:14,665] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 124 3: [2023-02-03 16:35:16,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_26_mp_rank_00_optim_states.pt. 3: [2023-02-03 16:35:16,869] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 26 11: [2023-02-03 16:35:16,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_92_mp_rank_00_optim_states.pt. 11: [2023-02-03 16:35:16,883] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 92 11: [2023-02-03 16:35:16,931] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 92 3: [2023-02-03 16:35:16,942] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 26 9: [2023-02-03 16:35:17,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_74_mp_rank_00_optim_states.pt. 9: [2023-02-03 16:35:17,590] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 74 4: [2023-02-03 16:35:17,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_39_mp_rank_00_optim_states.pt. 4: [2023-02-03 16:35:17,633] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 39 9: [2023-02-03 16:35:17,682] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 74 4: [2023-02-03 16:35:17,750] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 39 2: [2023-02-03 16:35:17,772] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_23_mp_rank_00_optim_states.pt. 2: [2023-02-03 16:35:17,772] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 23 4: [2023-02-03 16:35:17,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-1b5-66b/global_step125429/bf16_zero_pp_rank_36_mp_rank_00_optim_states.pt. 4: [2023-02-03 16:35:17,785] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 128 ZeRO state_dicts for rank 36 4: [2023-02-03 16:35:17,875] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 36 2: [2023-02-03 16:35:17,903] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 128 zero partition checkpoints for rank 23 0: successfully loaded checkpoint from lm1-1b5-66b at iteration 125429 15: time (ms) | load-checkpoint: 14164.37 0: estimated model parameters: 1.517252608 0: estimated model parameters without embeddings: 1.410035712 0: [after model, optimizer, and learning rate scheduler are built] datetime: 2023-02-03 16:35:18 0: > building train, validation, and test datasets ... 0: > datasets target sizes (minimum size): 0: train: 32109839 0: validation: 3225600 0: test: 25600 0: > building train, validation, and test datasets for GPT ... 0: > building dataset index ... 0: reading sizes... 0: reading pointers... 0: reading document index... 0: creating numpy buffer of mmap... 0: creating memory view of numpy buffer... 0: > finished creating indexed dataset in 0.008226 seconds 0: number of documents: 25071777 0: > dataset split: 0: train: 0: document indices in [0, 25071777) total of 25071777 documents 0: > loading doc-idx mapping from /scratch/project_462000119/data/c4_subsampled/gpt2tok_c4_en_12B_text_document_train_indexmap_32109839ns_2048sl_1234s_doc_idx.npy 0: > loading sample-idx mapping from /scratch/project_462000119/data/c4_subsampled/gpt2tok_c4_en_12B_text_document_train_indexmap_32109839ns_2048sl_1234s_sample_idx.npy 0: > loading shuffle-idx mapping from /scratch/project_462000119/data/c4_subsampled/gpt2tok_c4_en_12B_text_document_train_indexmap_32109839ns_2048sl_1234s_shuffle_idx.npy 0: loaded indexed file in 0.098 seconds 0: total number of samples: 35145305 0: total number of epochs: 6 0: > building dataset index ... 0: reading sizes... 0: reading pointers... 0: reading document index... 0: creating numpy buffer of mmap... 0: creating memory view of numpy buffer... 0: > finished creating indexed dataset in 0.052438 seconds 0: number of documents: 364608 0: > dataset split: 0: validation: 0: document indices in [0, 364608) total of 364608 documents 0: > loading doc-idx mapping from /scratch/project_462000119/data/c4_validation/gpt2tok_c4validation_rerun_text_document_validation_indexmap_3225600ns_2048sl_1234s_doc_idx.npy 0: > loading sample-idx mapping from /scratch/project_462000119/data/c4_validation/gpt2tok_c4validation_rerun_text_document_validation_indexmap_3225600ns_2048sl_1234s_sample_idx.npy 0: > loading shuffle-idx mapping from /scratch/project_462000119/data/c4_validation/gpt2tok_c4validation_rerun_text_document_validation_indexmap_3225600ns_2048sl_1234s_shuffle_idx.npy 0: loaded indexed file in 0.081 seconds 0: total number of samples: 3229145 0: total number of epochs: 38 0: > finished creating GPT datasets ... 0: [after dataloaders are built] datetime: 2023-02-03 16:35:34 0: done with setup ... 0: training ... 15: time (ms) | model-and-optimizer-setup: 37089.67 | train/valid/test-data-iterators-setup: 15061.95 0: [after training is done] datetime: 2023-02-03 16:35:34 15: ----------------------------------------------------------------------------------------------------------------- 15: validation loss at the end of training for val data | lm loss value: 3.035192E+00 | lm loss PPL: 2.080496E+01 | 15: ----------------------------------------------------------------------------------------------------------------- END 2806053: Fri Feb 3 16:36:18 EET 2023